Approximate Constrained Discounted Dynamic Programming with Uniform   Feasibility and Optimality

Hyeong Soo Chang

arXiv:2308.03297·math.OC·August 8, 2023·IEEE Trans. Autom. Control.

Approximate Constrained Discounted Dynamic Programming with Uniform Feasibility and Optimality

Hyeong Soo Chang

PDF

Open Access

TL;DR

This paper develops a dynamic programming approach for approximately solving infinite-horizon constrained Markov decision processes with uniform feasibility, providing recursive equations and algorithms that ensure optimality and convergence.

Contribution

It introduces a DP-equation for CMDPs that guarantees uniform feasibility and optimality, along with algorithms inspired by policy iteration for efficient computation.

Findings

01

Derived a recursive DP-equation for CMDPs.

02

Proposed algorithms with local convergence guarantees.

03

Ensured policies are uniformly optimal and feasible.

Abstract

We consider a dynamic programming (DP) approach to approximately solving an infinite-horizon constrained Markov decision process (CMDP) problem with a fixed initial-state for the expected total discounted-reward criterion with a uniform-feasibility constraint of the expected total discounted-cost in a deterministic, history-independent, and stationary policy set. We derive a DP-equation that recursively holds for a CMDP problem and its sub-CMDP problems, where each problem, induced from the parameters of the original CMDP problem, admits a uniformly-optimal feasible policy in its policy set associated with the inputs to the problem. A policy constructed from the DP-equation is shown to achieve the optimal values, defined for the CMDP problem the policy is a solution to, at all states. Based on the result, we discuss off-line and on-line computational algorithms, motivated from policy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSupply Chain and Inventory Management