Loading paper
On the Generalization Benefit of Noise in Stochastic Gradient Descent | Tomesphere