On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and   Non-Asymptotic Concentration

Wenlong Mou; Chris Junchi Li; Martin J. Wainwright; Peter L. Bartlett,; Michael I. Jordan

arXiv:2004.04719·stat.ML·April 10, 2020·21 cites

On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration

Wenlong Mou, Chris Junchi Li, Martin J. Wainwright, Peter L. Bartlett,, Michael I. Jordan

PDF

Open Access

TL;DR

This paper provides a detailed analysis of linear stochastic approximation with Polyak-Ruppert averaging, establishing precise asymptotic distributions and non-asymptotic concentration bounds, with applications to reinforcement learning and optimization.

Contribution

It offers the first comprehensive CLT and concentration inequalities for linear stochastic approximation under general spectral conditions, including non-Hurwitz matrices.

Findings

01

CLT characterizes asymptotic covariance with correction term

02

Non-asymptotic bounds match CLT covariance up to constants

03

Achieves O(1/T) mean-squared error rate for non-Hurwitz matrices

Abstract

We undertake a precise study of the asymptotic and non-asymptotic properties of stochastic approximation procedures with Polyak-Ruppert averaging for solving a linear system $\overset{ˉ}{A} θ = \overset{ˉ}{b}$ . When the matrix $\overset{ˉ}{A}$ is Hurwitz, we prove a central limit theorem (CLT) for the averaged iterates with fixed step size and number of iterations going to infinity. The CLT characterizes the exact asymptotic covariance matrix, which is the sum of the classical Polyak-Ruppert covariance and a correction term that scales with the step size. Under assumptions on the tail of the noise distribution, we prove a non-asymptotic concentration inequality whose main term matches the covariance in CLT in any direction, up to universal constants. When the matrix $\overset{ˉ}{A}$ is not Hurwitz but only has non-negative real parts in its eigenvalues, we prove that the averaged LSA procedure actually…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Random Matrices and Applications · Markov Chains and Monte Carlo Methods