Loading paper
Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers | Tomesphere