Loading paper
Special Properties of Gradient Descent with Large Learning Rates | Tomesphere