A Fully Polynomial Time Approximation Scheme for Constrained MDPs and   Stochastic Shortest Path under Local Transitions

Majid Khonji

arXiv:2204.04780·cs.AI·April 19, 2023

A Fully Polynomial Time Approximation Scheme for Constrained MDPs and Stochastic Shortest Path under Local Transitions

Majid Khonji

PDF

Open Access

TL;DR

This paper introduces a fully polynomial-time approximation scheme for constrained Markov Decision Processes with local transitions, enabling near-optimal planning in complex stochastic environments with safety constraints.

Contribution

It presents the first efficient approximation algorithm for (C)C-MDPs with local transitions, addressing NP-hardness and providing practical policy computation methods.

Findings

01

The algorithm achieves near-optimal policies within polynomial time.

02

Local transition structure simplifies the complexity of constrained MDPs.

03

The approach offers theoretical insights into the approximability of constrained stochastic planning.

Abstract

The fixed-horizon constrained Markov Decision Process (C-MDP) is a well-known model for planning in stochastic environments under operating constraints. Chance-Constrained MDP (CC-MDP) is a variant that allows bounding the probability of constraint violation, which is desired in many safety-critical applications. CC-MDP can also model a class of MDPs, called Stochastic Shortest Path (SSP), under dead-ends, where there is a trade-off between the probability-to-goal and cost-to-goal. This work studies the structure of (C)C-MDP, particularly an important variant that involves local transition. In this variant, the state reachability exhibits a certain degree of locality and independence from the remaining states. More precisely, the number of states, at a given time, that share some reachable future states is always constant. (C)C-MDP under local transition is NP-Hard even for a planning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Logic, Reasoning, and Knowledge