Convergence Analysis of Stochastic Accelerated Gradient Methods for   Generalized Smooth Optimizations

Chenhao Yu; Yusu Hong; Junhong Lin

arXiv:2502.11125·math.OC·February 25, 2025

Convergence Analysis of Stochastic Accelerated Gradient Methods for Generalized Smooth Optimizations

Chenhao Yu, Yusu Hong, Junhong Lin

PDF

Open Access

TL;DR

This paper analyzes the convergence rates of the RSAG method for stochastic optimization with generalized smooth functions, providing high-probability bounds under relaxed noise assumptions and extending results to SGD.

Contribution

It introduces convergence guarantees for RSAG with constant or adaptive step sizes under relaxed noise conditions, improving understanding of stochastic accelerated methods.

Findings

01

High-probability convergence rate of (\u007F(rac{\u221a{\u0131}(rac{1}{\u03b4})}{T})) for convex functions.

02

Improved convergence rate of (rac{\u221a{\u0131}(rac{1}{\u03b4})}{T}) when noise is small.

03

Applicability of analysis to SGD with various step size strategies.

Abstract

We investigate the Randomized Stochastic Accelerated Gradient (RSAG) method, utilizing either constant or adaptive step sizes, for stochastic optimization problems with generalized smooth objective functions. Under relaxed affine variance assumptions for the stochastic gradient noise, we establish high-probability convergence rates of order $\tilde{O} (lo g (1/ δ) / T)$ for function value gaps in the convex setting, and for the squared gradient norms in the non-convex setting. Furthermore, when the noise parameters are sufficiently small, the convergence rate improves to $\tilde{O} (lo g (1/ δ) / T)$ , where $T$ denotes the total number of iterations and $δ$ is the probability margin. Our analysis is also applicable to SGD with both constant and adaptive step sizes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques