Loading paper
Theory of Deep Learning IIb: Optimization Properties of SGD | Tomesphere