Loading paper
Finite-Time Analysis of Gradient Descent for Shallow Transformers | Tomesphere