Markov Chain Variance Estimation: A Stochastic Approximation Approach

Shubhada Agrawal; Prashanth L.A.; Siva Theja Maguluri

arXiv:2409.05733·math.ST·September 24, 2024

Markov Chain Variance Estimation: A Stochastic Approximation Approach

Shubhada Agrawal, Prashanth L.A., Siva Theja Maguluri

PDF

Open Access

TL;DR

This paper introduces a recursive, efficient estimator for the asymptotic variance of functions on Markov chains, with applications in reinforcement learning and statistical inference, offering finite-sample guarantees and broad generalizations.

Contribution

A novel recursive estimator for asymptotic variance that is computationally efficient, does not require historical data, and is applicable to large state spaces and vector-valued functions.

Findings

01

Achieves optimal $O(1/n)$ MSE convergence rate.

02

Provides finite-sample guarantees for the estimator.

03

Demonstrates applications in risk-sensitive reinforcement learning.

Abstract

We consider the problem of estimating the asymptotic variance of a function defined on a Markov chain, an important step for statistical inference of the stationary mean. We design a novel recursive estimator that requires $O (1)$ computation at each step, does not require storing any historical samples or any prior knowledge of run-length, and has optimal $O (\frac{1}{n})$ rate of convergence for the mean-squared error (MSE) with provable finite sample guarantees. Here, $n$ refers to the total number of samples generated. Our estimator is based on linear stochastic approximation of an equivalent formulation of the asymptotic variance in terms of the solution of the Poisson equation. We generalize our estimator in several directions, including estimating the covariance matrix for vector-valued functions, estimating the stationary variance of a Markov chain, and approximately estimating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSimulation Techniques and Applications