A Fully Polynomial Time Approximation Scheme for Constrained MDPs and Stochastic Shortest Path under Local Transitions
Majid Khonji

TL;DR
This paper introduces a fully polynomial-time approximation scheme for constrained Markov Decision Processes with local transitions, enabling near-optimal planning in complex stochastic environments with safety constraints.
Contribution
It presents the first efficient approximation algorithm for (C)C-MDPs with local transitions, addressing NP-hardness and providing practical policy computation methods.
Findings
The algorithm achieves near-optimal policies within polynomial time.
Local transition structure simplifies the complexity of constrained MDPs.
The approach offers theoretical insights into the approximability of constrained stochastic planning.
Abstract
The fixed-horizon constrained Markov Decision Process (C-MDP) is a well-known model for planning in stochastic environments under operating constraints. Chance-Constrained MDP (CC-MDP) is a variant that allows bounding the probability of constraint violation, which is desired in many safety-critical applications. CC-MDP can also model a class of MDPs, called Stochastic Shortest Path (SSP), under dead-ends, where there is a trade-off between the probability-to-goal and cost-to-goal. This work studies the structure of (C)C-MDP, particularly an important variant that involves local transition. In this variant, the state reachability exhibits a certain degree of locality and independence from the remaining states. More precisely, the number of states, at a given time, that share some reachable future states is always constant. (C)C-MDP under local transition is NP-Hard even for a planning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Logic, Reasoning, and Knowledge
