Loading paper
Large Spikes in Stochastic Gradient Descent: A Large-Deviations View | Tomesphere