Loading paper
Geometric Analysis of Token Selection in Multi-Head Attention | Tomesphere