Average-Cost MDPs with Infinite State and Action Sets: New Sufficient   Conditions for Optimality Inequalities and Equations

Eugene A. Feinberg; Pavlo O. Kasyanov; Liliia S. Paliichuk

arXiv:2412.01594·math.OC·January 28, 2025

Average-Cost MDPs with Infinite State and Action Sets: New Sufficient Conditions for Optimality Inequalities and Equations

Eugene A. Feinberg, Pavlo O. Kasyanov, Liliia S. Paliichuk

PDF

Open Access

TL;DR

This paper establishes new sufficient conditions for the validity of optimality inequalities and equations in infinite-horizon average-cost MDPs with infinite state and action spaces, ensuring the existence of deterministic optimal policies.

Contribution

It introduces novel conditions for optimality in average-cost MDPs with continuous transition probabilities, expanding theoretical understanding.

Findings

01

New sufficient conditions for optimality inequalities

02

Conditions ensuring the validity of optimality equations

03

Existence of deterministic optimal policies under these conditions

Abstract

This paper studies discrete-time average-cost infinite-horizon Markov decision processes (MDPs) with Borel state and action sets. It introduces new sufficient conditions for { the} validity of optimality inequalities and optimality equations for MDPs with weakly and setwise continuous transition probabilities. These inequalities and equations imply the existence of deterministic optimal policies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProcess Optimization and Integration