Online Statistical Inference for Nonlinear Stochastic Approximation with   Markovian Data

Xiang Li; Jiadong Liang; Zhihua Zhang

arXiv:2302.07690·math.ST·February 21, 2023·1 cites

Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data

Xiang Li, Jiadong Liang, Zhihua Zhang

PDF

Open Access

TL;DR

This paper develops a statistical inference framework for nonlinear stochastic approximation algorithms using Markovian data, establishing a central limit theorem and confidence intervals applicable to methods like SGD and Q-learning.

Contribution

It introduces a functional central limit theorem and an inference method for nonlinear stochastic approximation with Markovian data, including practical confidence interval construction.

Findings

01

Established a functional central limit theorem for the partial-sum process

02

Provided a semiparametric efficient lower bound and non-asymptotic bounds

03

Validated the method's effectiveness through simulations

Abstract

We study the statistical inference of nonlinear stochastic approximation algorithms utilizing a single trajectory of Markovian data. Our methodology has practical applications in various scenarios, such as Stochastic Gradient Descent (SGD) on autoregressive data and asynchronous Q-Learning. By utilizing the standard stochastic approximation (SA) framework to estimate the target parameter, we establish a functional central limit theorem for its partial-sum process, $ϕ_{T}$ . To further support this theory, we provide a matching semiparametric efficient lower bound and a non-asymptotic upper bound on its weak convergence, measured in the L\'evy-Prokhorov metric. This functional central limit theorem forms the basis for our inference method. By selecting any continuous scale-invariant functional $f$ , the asymptotic pivotal statistic $f (ϕ_{T})$ becomes accessible,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Statistical Methods and Inference

MethodsQ-Learning