Universal Gradient Methods for Stochastic Convex Optimization

Anton Rodomanov; Ali Kavis; Yongtao Wu; Kimon Antonakopoulos; Volkan; Cevher

arXiv:2402.03210·math.OC·July 12, 2024·ICML·1 cites

Universal Gradient Methods for Stochastic Convex Optimization

Anton Rodomanov, Ali Kavis, Yongtao Wu, Kimon Antonakopoulos, Volkan, Cevher

PDF

Open Access

TL;DR

This paper introduces universal gradient algorithms for stochastic convex optimization that adapt to noise and smoothness levels without prior knowledge, achieving state-of-the-art convergence guarantees.

Contribution

The paper presents novel universal gradient methods that automatically adapt to noise and smoothness in stochastic convex optimization, with improved convergence guarantees.

Findings

01

Achieves state-of-the-art worst-case convergence rates.

02

Adapts to H"older smoothness without prior knowledge.

03

Provides optimal efficiency estimates for the universal fast gradient method.

Abstract

We develop universal gradient methods for Stochastic Convex Optimization (SCO). Our algorithms automatically adapt not only to the oracle's noise but also to the H\"older smoothness of the objective function without a priori knowledge of the particular setting. The key ingredient is a novel strategy for adjusting step-size coefficients in the Stochastic Gradient Method (SGD). Unlike AdaGrad, which accumulates gradient norms, our Universal Gradient Method accumulates appropriate combinations of gradient- and iterate differences. The resulting algorithm has state-of-the-art worst-case convergence rate guarantees for the entire H\"older class including, in particular, both nonsmooth functions and those with Lipschitz continuous gradient. We also present the Universal Fast Gradient Method for SCO enjoying optimal efficiency estimates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference