Loading paper
Routing Absorption in Sparse Attention: Why Random Gates Are Hard to Beat | Tomesphere