Finite Optimal Control for Time-Bounded Reachability in CTMDPs and   Continuous-Time Markov Games

Markus Rabe; Sven Schewe

arXiv:1004.4005·cs.FL·June 7, 2010·4 cites

Finite Optimal Control for Time-Bounded Reachability in CTMDPs and Continuous-Time Markov Games

Markus Rabe, Sven Schewe

PDF

Open Access

TL;DR

This paper proves the existence of finite, deterministic, and time-structured optimal control strategies for time-bounded reachability in continuous-time Markov decision processes and games, simplifying their analysis.

Contribution

It establishes the existence of finite, deterministic, and time-partitioned optimal strategies for CTMDPs and extends these results to continuous-time Markov games.

Findings

01

Optimal strategies are deterministic and timed-positional.

02

Time can be divided into finite intervals with simple strategies.

03

Properties extend from CTMDPs to Markov games.

Abstract

We establish the existence of optimal scheduling strategies for time-bounded reachability in continuous-time Markov decision processes, and of co-optimal strategies for continuous-time Markov games. Furthermore, we show that optimal control does not only exist, but has a surprisingly simple structure: The optimal schedulers from our proofs are deterministic and timed-positional, and the bounded time can be divided into a finite number of intervals, in which the optimal strategies are positional. That is, we demonstrate the existence of finite optimal control. Finally, we show that these pleasant properties of Markov decision processes extend to the more general class of continuous-time Markov games, and that both early and late schedulers show this behaviour.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Real-Time Systems Scheduling · Formal Methods in Verification