Quantifying Normality: Convergence Rate to Gaussian Limit for Stochastic Approximation and Unadjusted OU Algorithm
Shaan Ul Haque, Zedong Wang, Zixuan Zhang, Siva Theja Maguluri

TL;DR
This paper provides explicit finite-time bounds on how quickly stochastic approximation algorithms converge to their Gaussian limit, improving understanding of their accuracy in practical, finite-time scenarios.
Contribution
It introduces non-asymptotic Wasserstein distance bounds for SA iterates, connecting convergence rates to the discrete Ornstein-Uhlenbeck process and adapting Stein's method for matrix-weighted sums.
Findings
Established explicit convergence rates for SA to Gaussian limit.
Derived tail bounds for SA errors at any finite time.
Connected convergence analysis to sampling literature via Ornstein-Uhlenbeck process.
Abstract
Stochastic approximation (SA) is a method for finding the root of an operator perturbed by noise. There is a rich literature establishing the asymptotic normality of rescaled SA iterates under fairly mild conditions. However, these asymptotic results do not quantify the accuracy of the Gaussian approximation in finite time. In this paper, we establish explicit non-asymptotic bounds on the Wasserstein distance between the distribution of the rescaled iterate at time k and the asymptotic Gaussian limit for various choices of step-sizes including constant and polynomially decaying. As an immediate consequence, we obtain tail bounds on the error of SA iterates at any time. We obtain the sharp rates by first studying the convergence rate of the discrete Ornstein-Uhlenbeck (O-U) process driven by general noise, whose stationary distribution is identical to the limiting Gaussian distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Random Matrices and Applications · Markov Chains and Monte Carlo Methods
