Finite-sum optimization: Adaptivity to smoothness and loopless variance reduction
Bastien Batardi\`ere, Joon Kwon

TL;DR
This paper introduces AdaLVR, a new adaptive variance-reduced gradient method that combines AdaGrad with loopless variance reduction techniques, achieving faster convergence without prior knowledge of smoothness parameters.
Contribution
AdaLVR integrates AdaGrad with loopless variance reduction, providing an adaptive, efficient optimization method with proven convergence guarantees for smooth convex functions.
Findings
AdaLVR achieves optimal gradient complexity without prior smoothness knowledge.
Numerical experiments show AdaLVR outperforms existing methods.
Combining RMSprop or Adam with variance reduction accelerates convergence.
Abstract
For finite-sum optimization, variance-reduced gradient methods (VR) compute at each iteration the gradient of a single function (or of a mini-batch), and yet achieve faster convergence than SGD thanks to a carefully crafted lower-variance stochastic gradient estimator that reuses past gradients. Another important line of research of the past decade in continuous optimization is the adaptive algorithms such as AdaGrad, that dynamically adjust the (possibly coordinate-wise) learning rate to past gradients and thereby adapt to the geometry of the objective function. Variants such as RMSprop and Adam demonstrate outstanding practical performance that have contributed to the success of deep learning. In this work, we present AdaLVR, which combines the AdaGrad algorithm with loopless variance-reduced gradient estimators such as SAGA or L-SVRG that benefits from a straightforward construction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
