Interval Markov Decision Processes with Continuous Action-Spaces
Giannis Delimpaltadakis, Morteza Lahijanian, Manuel Mazo Jr., Luca Laurenti

TL;DR
This paper introduces continuous-action IMDPs (caIMDPs), extending traditional IMDPs to handle continuous action spaces, and develops efficient value iteration algorithms for control synthesis in uncertain stochastic systems.
Contribution
The paper proposes caIMDPs with action-dependent transition bounds and provides algorithms for their value iteration, enabling control synthesis with continuous actions.
Findings
Efficient value iteration algorithms for caIMDPs using linear or convex programming.
Synthesis over discrete actions at vertices of the action polytope can be optimal.
Demonstrated methods on a numerical example.
Abstract
Interval Markov Decision Processes (IMDPs) are finite-state uncertain Markov models, where the transition probabilities belong to intervals. Recently, there has been a surge of research on employing IMDPs as abstractions of stochastic systems for control synthesis. However, due to the absence of algorithms for synthesis over IMDPs with continuous action-spaces, the action-space is assumed discrete a-priori, which is a restrictive assumption for many applications. Motivated by this, we introduce continuous-action IMDPs (caIMDPs), where the bounds on transition probabilities are functions of the action variables, and study value iteration for maximizing expected cumulative rewards. Specifically, we decompose the max-min problem associated to value iteration to max problems, where is the number of states of the caIMDP. Then, exploiting the simple form of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Petri Nets in System Modeling · Flexible and Reconfigurable Manufacturing Systems
