Loading paper
u-$\mu$P: The Unit-Scaled Maximal Update Parametrization | Tomesphere