Finite-Sample Wasserstein Error Bounds and Concentration Inequalities for Nonlinear Stochastic Approximation
Seo Taek Kong, R. Srikant

TL;DR
This paper provides explicit finite-sample error bounds and concentration inequalities for nonlinear stochastic approximation algorithms in Wasserstein distance, applicable to various noise conditions, and analyzes their distributional convergence rates.
Contribution
It introduces a coupling method for finite-sample Wasserstein bounds and extends distributional convergence analysis to general noise settings, including Markov chains.
Findings
Normalized last iterates converge to Gaussian at rate γ_n^{1/6}
Polyak-Ruppert average converges at rate n^{-1/6}
Results apply to stochastic gradient descent and linear stochastic approximation
Abstract
This paper derives non-asymptotic error bounds for nonlinear stochastic approximation algorithms in the Wasserstein- distance. To obtain explicit finite-sample guarantees for the last iterate, we develop a coupling argument that compares the discrete-time process to a limiting Ornstein-Uhlenbeck process. Our analysis applies to algorithms driven by general noise conditions, including martingale differences and functions of ergodic Markov chains. Complementing this result, we handle the convergence rate of the Polyak-Ruppert average through a direct analysis that applies under the same general setting. Assuming the driving noise satisfies a non-asymptotic central limit theorem, we show that the normalized last iterates converge to a Gaussian distribution in the -Wasserstein distance at a rate of order , where is the step size. Similarly, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Geometric Analysis and Curvature Flows
