Loading paper
diffGrad: An Optimization Method for Convolutional Neural Networks | Tomesphere