Loading paper
SGD with a Constant Large Learning Rate Can Converge to Local Maxima | Tomesphere