Loading paper
Increasing Both Batch Size and Learning Rate Accelerates Stochastic Gradient Descent | Tomesphere