MDPs with Setwise Continuous Transition Probabilities

Eugene A. Feinberg; Pavlo O. Kasyanov

arXiv:2011.01325·math.OC·August 3, 2021·Oper. Res. Lett.

MDPs with Setwise Continuous Transition Probabilities

Eugene A. Feinberg, Pavlo O. Kasyanov

PDF

TL;DR

This paper investigates the structure of optimal policies in infinite-state Markov Decision Processes with setwise continuous transition probabilities, accommodating noncompact action sets and various cost criteria, using a new optimal selection theorem.

Contribution

Introduces a novel optimal selection theorem for inf-compact functions and applies it to analyze optimal policies in complex MDPs with setwise continuous transitions.

Findings

01

Characterizes optimal policies under setwise continuity.

02

Handles noncompact action sets in infinite-state MDPs.

03

Provides a unified approach for discounted, undiscounted, and average costs.

Abstract

This paper describes the structure of optimal policies for infinite-state Markov Decision Processes with setwise continuous transition probabilities. The action sets may be noncompact. The objective criteria are either the expected total discounted and undiscounted costs or average costs per unit time. The analysis of optimality equations and inequalities is based on the optimal selection theorem for inf-compact functions introduced in this paper.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.