Likelihood Ratio Gradient Estimation for Steady-State Parameters
Peter W. Glynn, Mariana Olvera-Cravioto

TL;DR
This paper develops likelihood ratio methods to estimate the gradient of steady-state expectations in parameterized Markov chains, providing conditions for differentiability and analyzing estimator behavior.
Contribution
It introduces two likelihood ratio estimators for steady-state gradient estimation and establishes their theoretical properties under geometric ergodicity.
Findings
Provided sufficient conditions for differentiability of steady-state expectations.
Proposed two likelihood ratio estimators for gradient estimation.
Analyzed the limiting behavior of the estimators.
Abstract
We consider a discrete-time Markov chain on a general state-space , whose transition probabilities are parameterized by a real-valued vector . Under the assumption that is geometrically ergodic with corresponding stationary distribution , we are interested in estimating the gradient of the steady-state expectation To this end, we first give sufficient conditions for the differentiability of and for the calculation of its gradient via a sequence of finite horizon expectations. We then propose two different likelihood ratio estimators and analyze their limiting behavior.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
