Likelihood Ratio Gradient Estimation for Steady-State Parameters

Peter W. Glynn; Mariana Olvera-Cravioto

arXiv:1707.02659·math.PR·March 12, 2018

Likelihood Ratio Gradient Estimation for Steady-State Parameters

Peter W. Glynn, Mariana Olvera-Cravioto

PDF

TL;DR

This paper develops likelihood ratio methods to estimate the gradient of steady-state expectations in parameterized Markov chains, providing conditions for differentiability and analyzing estimator behavior.

Contribution

It introduces two likelihood ratio estimators for steady-state gradient estimation and establishes their theoretical properties under geometric ergodicity.

Findings

01

Provided sufficient conditions for differentiability of steady-state expectations.

02

Proposed two likelihood ratio estimators for gradient estimation.

03

Analyzed the limiting behavior of the estimators.

Abstract

We consider a discrete-time Markov chain $Φ$ on a general state-space $X$ , whose transition probabilities are parameterized by a real-valued vector $θ$ . Under the assumption that $Φ$ is geometrically ergodic with corresponding stationary distribution $π (θ)$ , we are interested in estimating the gradient $\nabla α (θ)$ of the steady-state expectation $α (θ) = π (θ) f .$ To this end, we first give sufficient conditions for the differentiability of $α (θ)$ and for the calculation of its gradient via a sequence of finite horizon expectations. We then propose two different likelihood ratio estimators and analyze their limiting behavior.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.