A Note on the Bias and Kemeny's Constant in Markov Reward Processes with   an Application to Markov Chain Perturbation

Ronald Ortner

arXiv:2408.04454·math.PR·August 9, 2024

A Note on the Bias and Kemeny's Constant in Markov Reward Processes with an Application to Markov Chain Perturbation

Ronald Ortner

PDF

Open Access

TL;DR

This paper derives explicit bias expressions for unichain Markov reward processes, generalizes perturbation bounds for non-irreducible chains, and offers an intuitive interpretation of Kemeny's constant as a bias measure.

Contribution

It provides a new explicit formula for bias in MRPs, extends perturbation bounds to non-irreducible chains, and offers an intuitive understanding of Kemeny's constant.

Findings

01

Explicit bias expression in terms of mean first passage times

02

Generalized perturbation bounds for non-irreducible chains

03

Kemeny's constant as translated bias in a constant reward MRP

Abstract

Given a unichain Markov reward process (MRP), we provide an explicit expression for the bias values in terms of mean first passage times. This result implies a generalization of known Markov chain perturbation bounds for the stationary distribution to the case where the perturbed chain is not irreducible. It further yields an improved perturbation bound in 1-norm. As a special case, Kemeny's constant can be interpreted as the translated bias in an MRP with constant reward 1, which offers an intuitive explanation why it is a constant.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProbability and Risk Models · Simulation Techniques and Applications