Finite-State Approximations to Discounted and Average Cost Constrained   Markov Decision Processes

Naci Saldi

arXiv:1807.02994·math.OC·July 10, 2018·IEEE Trans. Autom. Control.

Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes

Naci Saldi

PDF

Open Access

TL;DR

This paper develops methods for approximating constrained Markov decision processes with finite states, proving convergence of the optimal values and providing explicit bounds on the approximation error for both discounted and average cost criteria.

Contribution

It introduces a unified approach for finite-state approximation of constrained MDPs, establishing convergence and error bounds under regularity conditions.

Findings

01

Optimal value convergence proven for finite-state models

02

Explicit error bounds for approximation quality

03

Method for computing approximately optimal policies

Abstract

In this paper, we consider the finite-state approximation of a discrete-time constrained Markov decision process (MDP) under the discounted and average cost criteria. Using the linear programming formulation of the constrained discounted cost problem, we prove the asymptotic convergence of the optimal value of the finite-state model to the optimal value of the original model. With further continuity condition on the transition probability, we also establish a method to compute approximately optimal policies. For the average cost, instead of using the finite-state linear programming approximation method, we use the original problem definition to establish the finite-state asymptotic approximation of the constrained problem and compute approximately optimal policies. Under Lipschitz type regularity conditions on the components of the MDP, we also obtain explicit rate of convergence bounds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Reinforcement Learning in Robotics · Machine Learning and Algorithms