Multiple-Environment Markov Decision Processes

Jean-Fran\c{c}ois Raskin; Ocan Sankur

arXiv:1405.4733·cs.LO·December 4, 2014

Multiple-Environment Markov Decision Processes

Jean-Fran\c{c}ois Raskin, Ocan Sankur

PDF

TL;DR

This paper introduces Multi-Environment Markov Decision Processes (MEMDPs), enabling the synthesis of controllers that perform reliably across multiple environments, with some verification problems becoming decidable and more efficient compared to partially observable MDPs.

Contribution

The paper defines MEMDPs and demonstrates that certain verification problems are decidable and can be solved efficiently, unlike in partially observable MDPs.

Findings

01

Verification problems are decidable for MEMDPs.

02

Some problems have efficient solutions.

03

MEMDPs extend the applicability of MDPs in uncertain environments.

Abstract

We introduce Multi-Environment Markov Decision Processes (MEMDPs) which are MDPs with a set of probabilistic transition functions. The goal in a MEMDP is to synthesize a single controller with guaranteed performances against all environments even though the environment is unknown a priori. While MEMDPs can be seen as a special class of partially observable MDPs, we show that several verification problems that are undecidable for partially observable MDPs, are decidable for MEMDPs and sometimes have even efficient solutions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.