Loading paper
Improving Vision Transformers by Overlapping Heads in Multi-Head Self-Attention | Tomesphere