Loading paper
LionMuon: Alternating Spectral and Sign Descent for Efficient Training | Tomesphere