Loading paper
SGD for Structured Nonconvex Functions: Learning Rates, Minibatching and Interpolation | Tomesphere