Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data
Xiang Li, Jiadong Liang, Zhihua Zhang

TL;DR
This paper develops a statistical inference framework for nonlinear stochastic approximation algorithms using Markovian data, establishing a central limit theorem and confidence intervals applicable to methods like SGD and Q-learning.
Contribution
It introduces a functional central limit theorem and an inference method for nonlinear stochastic approximation with Markovian data, including practical confidence interval construction.
Findings
Established a functional central limit theorem for the partial-sum process
Provided a semiparametric efficient lower bound and non-asymptotic bounds
Validated the method's effectiveness through simulations
Abstract
We study the statistical inference of nonlinear stochastic approximation algorithms utilizing a single trajectory of Markovian data. Our methodology has practical applications in various scenarios, such as Stochastic Gradient Descent (SGD) on autoregressive data and asynchronous Q-Learning. By utilizing the standard stochastic approximation (SA) framework to estimate the target parameter, we establish a functional central limit theorem for its partial-sum process, . To further support this theory, we provide a matching semiparametric efficient lower bound and a non-asymptotic upper bound on its weak convergence, measured in the L\'evy-Prokhorov metric. This functional central limit theorem forms the basis for our inference method. By selecting any continuous scale-invariant functional , the asymptotic pivotal statistic becomes accessible,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Statistical Methods and Inference
MethodsQ-Learning
