Loading paper
Beyond Linearity in Attention Projections: The Case for Nonlinear Queries | Tomesphere