Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance Reduction
Anton Rodomanov, Xiaowen Jiang, Sebastian Stich

TL;DR
This paper develops adaptive AdaGrad-based stochastic optimization methods that are effective for a broad class of convex problems, including inexact and H"older smooth cases, achieving optimal convergence without prior knowledge of problem constants.
Contribution
It introduces universal adaptive gradient methods with variance reduction and acceleration for inexact, H"older smooth convex optimization, achieving optimal convergence rates without prior parameter knowledge.
Findings
Proved efficiency of AdaGrad stepsizes in bounded variance settings.
Demonstrated implicit variance reduction properties under refined assumptions.
Integrated SVRG-type variance reduction for faster convergence.
Abstract
We present adaptive gradient methods (both basic and accelerated) for solving convex composite optimization problems in which the main part is approximately smooth (a.k.a. -smooth) and can be accessed only via a (potentially biased) stochastic gradient oracle. This setting covers many interesting examples including H\"older smooth problems and various inexact computations of the stochastic gradient. Our methods use AdaGrad stepsizes and are adaptive in the sense that they do not require knowing any problem-dependent constants except an estimate of the diameter of the feasible set but nevertheless achieve the best possible convergence rates as if they knew the corresponding constants. We demonstrate that AdaGrad stepsizes work in a variety of situations by proving, in a unified manner, three types of new results. First, we establish efficiency guarantees for our methods in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Simulation Techniques and Applications · Advanced Database Systems and Queries
