Loading paper
Spectral Conditioning of Attention Improves Transformer Performance | Tomesphere