Exponential Concentration in Stochastic Approximation

Kody Law; Neil Walton; Shangda Yang

arXiv:2208.07243·stat.ML·March 26, 2024

Exponential Concentration in Stochastic Approximation

Kody Law, Neil Walton, Shangda Yang

PDF

Open Access

TL;DR

This paper establishes exponential concentration bounds for stochastic approximation algorithms, providing a new perspective that complements traditional asymptotic normality results, and applies to several algorithms including stochastic gradient descent.

Contribution

It extends geometric ergodicity techniques to stochastic approximation, deriving exponential tail bounds and convergence rates for multiple algorithms.

Findings

01

Proves exponential concentration bounds for stochastic approximation.

02

Demonstrates linear and faster convergence rates for specific algorithms.

03

Extends Markov chain ergodicity results to stochastic approximation context.

Abstract

We analyze the behavior of stochastic approximation algorithms where iterates, in expectation, progress towards an objective at each step. When progress is proportional to the step size of the algorithm, we prove exponential concentration bounds. These tail-bounds contrast asymptotic normality results, which are more frequently associated with stochastic approximation. The methods that we develop rely on a geometric ergodicity proof. This extends a result on Markov chains due to Hajek (1982) to the area of stochastic approximation algorithms. We apply our results to several different Stochastic Approximation algorithms, specifically Projected Stochastic Gradient Descent, Kiefer-Wolfowitz and Stochastic Frank-Wolfe algorithms. When applicable, our results prove faster $O (1/ t)$ and linear convergence rates for Projected Stochastic Gradient Descent with a non-vanishing gradient.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiative Heat Transfer Studies · Advanced Optimization Algorithms Research