Loading paper
WNGrad: Learn the Learning Rate in Gradient Descent | Tomesphere