Power of Generalized Smoothness in Stochastic Convex Optimization: First- and Zero-Order Algorithms

Aleksandr Lobanov; Alexander Gasnikov

arXiv:2501.18198·math.OC·May 26, 2025

Power of Generalized Smoothness in Stochastic Convex Optimization: First- and Zero-Order Algorithms

Aleksandr Lobanov, Alexander Gasnikov

PDF

Open Access

TL;DR

This paper explores stochastic convex optimization under generalized smoothness, introducing new algorithms and convergence results for both first- and zero-order methods, with practical implications demonstrated through experiments.

Contribution

It extends convergence analysis to generalized smoothness, including biased oracles and zero-order algorithms, providing new complexity bounds and demonstrating linear convergence.

Findings

01

Derived iteration complexity bounds for stochastic gradient methods under generalized smoothness.

02

Extended convergence results to biased gradient oracles and zero-order algorithms.

03

Numerical experiments show linear convergence in convex stochastic optimization.

Abstract

This paper is devoted to the study of stochastic optimization problems under the generalized smoothness assumption. By considering the unbiased gradient oracle in Stochastic Gradient Descent, we provide strategies to achieve in bounds the summands describing linear rate. In particular, in the case $L_{0} = 0$ , we obtain in the convex setup the iteration complexity: $N = O (L_{1} R lo g \frac{1}{ε} + \frac{L _{1} c R ^{2}}{ε})$ for Clipped Stochastic Gradient Descent and $N = O (L_{1} R lo g \frac{1}{ε})$ for Normalized Stochastic Gradient Descent. Furthermore, we generalize the convergence results to the case with a biased gradient oracle, and show that the power of $(L_{0}, L_{1})$ -smoothness extends to zero-order algorithms. Finally, we demonstrate the possibility of linear convergence in the convex setup through numerical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques