Power of Generalized Smoothness in Stochastic Convex Optimization: First- and Zero-Order Algorithms
Aleksandr Lobanov, Alexander Gasnikov

TL;DR
This paper explores stochastic convex optimization under generalized smoothness, introducing new algorithms and convergence results for both first- and zero-order methods, with practical implications demonstrated through experiments.
Contribution
It extends convergence analysis to generalized smoothness, including biased oracles and zero-order algorithms, providing new complexity bounds and demonstrating linear convergence.
Findings
Derived iteration complexity bounds for stochastic gradient methods under generalized smoothness.
Extended convergence results to biased gradient oracles and zero-order algorithms.
Numerical experiments show linear convergence in convex stochastic optimization.
Abstract
This paper is devoted to the study of stochastic optimization problems under the generalized smoothness assumption. By considering the unbiased gradient oracle in Stochastic Gradient Descent, we provide strategies to achieve in bounds the summands describing linear rate. In particular, in the case , we obtain in the convex setup the iteration complexity: for Clipped Stochastic Gradient Descent and for Normalized Stochastic Gradient Descent. Furthermore, we generalize the convergence results to the case with a biased gradient oracle, and show that the power of -smoothness extends to zero-order algorithms. Finally, we demonstrate the possibility of linear convergence in the convex setup through numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
