Loading paper
Nexusformer: Nonlinear Attention Expansion for Stable and Inheritable Transformer Scaling | Tomesphere