On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration
Wenlong Mou, Chris Junchi Li, Martin J. Wainwright, Peter L. Bartlett,, Michael I. Jordan

TL;DR
This paper provides a detailed analysis of linear stochastic approximation with Polyak-Ruppert averaging, establishing precise asymptotic distributions and non-asymptotic concentration bounds, with applications to reinforcement learning and optimization.
Contribution
It offers the first comprehensive CLT and concentration inequalities for linear stochastic approximation under general spectral conditions, including non-Hurwitz matrices.
Findings
CLT characterizes asymptotic covariance with correction term
Non-asymptotic bounds match CLT covariance up to constants
Achieves O(1/T) mean-squared error rate for non-Hurwitz matrices
Abstract
We undertake a precise study of the asymptotic and non-asymptotic properties of stochastic approximation procedures with Polyak-Ruppert averaging for solving a linear system . When the matrix is Hurwitz, we prove a central limit theorem (CLT) for the averaged iterates with fixed step size and number of iterations going to infinity. The CLT characterizes the exact asymptotic covariance matrix, which is the sum of the classical Polyak-Ruppert covariance and a correction term that scales with the step size. Under assumptions on the tail of the noise distribution, we prove a non-asymptotic concentration inequality whose main term matches the covariance in CLT in any direction, up to universal constants. When the matrix is not Hurwitz but only has non-negative real parts in its eigenvalues, we prove that the averaged LSA procedure actually…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Random Matrices and Applications · Markov Chains and Monte Carlo Methods
