Loading paper
From SGD to Spectra: A Theory of Neural Network Weight Dynamics | Tomesphere