Refactor contact
Created by: lehr-fa
Merge #29 first!
- ContactHead now returns logits (symmetrized raw scores) instead of calling LogSoftmax
- Attention layers now return only the row attention maps (column attention maps are discarded)
Created by: lehr-fa
Merge #29 first!