Loading paper
The Bayesian Geometry of Transformer Attention | Tomesphere