Loading paper
Towards Explaining the Regularization Effect of Initial Large Learning Rate in Training Neural Networks | Tomesphere