Loading paper
Overshoot: Taking advantage of future gradients in momentum-based stochastic optimization | Tomesphere