A Generalized Version of Chung's Lemma and its Applications

Li Jiang; Xiao Li; Andre Milzarek; Junwen Qiu

arXiv:2406.05637·math.OC·February 11, 2026

A Generalized Version of Chung's Lemma and its Applications

Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

PDF

Open Access

TL;DR

This paper introduces a generalized Chung's Lemma that provides a unified, non-asymptotic framework for analyzing convergence rates of stochastic optimization methods under various step size strategies and conditions.

Contribution

It extends Chung's Lemma to a broader class of step sizes, enabling tight convergence analysis for stochastic methods under general conditions, including the -Polyak-Lojasiewicz condition.

Findings

01

Exponential step sizes adaptively achieve optimal convergence rates.

02

New non-asymptotic complexity results for SGD and RR under general conditions.

03

The generalized lemma simplifies convergence proofs across diverse step size rules.

Abstract

Chung's Lemma is a classical tool for establishing asymptotic convergence rates of (stochastic) optimization methods under strong convexity-type assumptions and appropriate polynomial diminishing step sizes. In this work, we develop a generalized version of Chung's Lemma, which provides a simple non-asymptotic convergence framework for a more general family of step size rules. We demonstrate broad applicability of the proposed generalized lemma by deriving tight non-asymptotic convergence rates for a large variety of stochastic methods. In particular, we obtain partially new non-asymptotic complexity results for stochastic optimization methods, such as Stochastic Gradient Descent (SGD) and Random Reshuffling (RR), under a general $(θ, μ)$ -Polyak-Lojasiewicz (PL) condition and for various step sizes strategies, including polynomial, constant, exponential, and cosine step sizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph theory and applications