Convex Approximations of Random Constrained Markov Decision Processes

V Varagapriya; Vikas Vikram Singh; Abdel Lisser

arXiv:2505.24815·math.OC·June 2, 2025

Convex Approximations of Random Constrained Markov Decision Processes

V Varagapriya, Vikas Vikram Singh, Abdel Lisser

PDF

TL;DR

This paper develops convex approximation methods for solving joint chance-constrained Markov decision processes with random costs and transition probabilities, providing bounds and numerical validation.

Contribution

It introduces convex upper and lower bounds for uncertain CMDPs with random costs and transitions, extending existing methods to stochastic settings.

Findings

01

Convex bounds effectively approximate uncertain CMDPs.

02

Bounds' quality validated through queueing and Garnet MDP experiments.

03

Proposed methods handle dependencies via Gumbel-Hougaard copula.

Abstract

Constrained Markov decision processes (CMDPs) are used as a decision-making framework to study the long-run performance of a stochastic system. It is well-known that a stationary optimal policy of a CMDP problem under discounted cost criterion can be obtained by solving a linear programming problem when running costs and transition probabilities are exactly known. In this paper, we consider a discounted cost CMDP problem where the running costs and transition probabilities are defined using random variables. Consequently, both the objective function and constraints become random. We use chance constraints to model these uncertainties and formulate the uncertain CMDP problem as a joint chance-constrained Markov decision process (JCCMDP). Under random running costs, we assume that the dependency among random constraint vectors is driven by a Gumbel-Hougaard copula. Using standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.