Loading paper
Symmetry Breaking in Transformers for Efficient and Interpretable Training | Tomesphere