Metrics for Markov Decision Processes with Infinite State Spaces

Norman Ferns; Prakash Panangaden; Doina Precup

arXiv:1207.1386·cs.AI·July 9, 2012·46 cites

Metrics for Markov Decision Processes with Infinite State Spaces

Norman Ferns, Prakash Panangaden, Doina Precup

PDF

Open Access

TL;DR

This paper introduces metrics for quantifying state similarity in infinite-state Markov decision processes, enabling stable approximations and demonstrating the continuity of the value function with respect to these metrics.

Contribution

It proposes a new class of metrics for infinite-state MDPs that generalize bisimulation and support approximation methods.

Findings

01

Value functions are continuous under the proposed metrics.

02

Metrics extend bisimulation to infinite and continuous state spaces.

03

Facilitates stable MDP approximations.

Abstract

We present metrics for measuring state similarity in Markov decision processes (MDPs) with infinitely many states, including MDPs with continuous state spaces. Such metrics provide a stable quantitative analogue of the notion of bisimulation for MDPs, and are suitable for use in MDP approximation. We show that the optimal value function associated with a discounted infinite horizon planning task varies continuously with respect to our metric distances.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFormal Methods in Verification · Bayesian Modeling and Causal Inference · Software Reliability and Analysis Research