Strategy Complexity of Limsup and Liminf Threshold Objectives in   Countable MDPs, with Applications to Optimal Expected Payoffs

Richard Mayr; Eric Munday

arXiv:2211.13259·math.OC·September 19, 2024

Strategy Complexity of Limsup and Liminf Threshold Objectives in Countable MDPs, with Applications to Optimal Expected Payoffs

Richard Mayr, Eric Munday

PDF

Open Access

TL;DR

This paper analyzes the strategy complexity in countable Markov decision processes for $ ext{limsup}$ and $ ext{liminf}$ threshold objectives, providing bounds on memory requirements and solving open problems related to optimal expected payoffs.

Contribution

It establishes the complete strategy complexity bounds for $ ext{limsup}$ and $ ext{liminf}$ objectives in countable MDPs and applies these results to open problems on optimal strategies for expected payoffs.

Findings

01

Bounds on memory requirements for $ ext{limsup}$ and $ ext{liminf}$ objectives.

02

Complete characterization of strategy complexity in countable MDPs.

03

Resolution of open problems on optimal strategies for expected $ ext{limsup}$ and $ ext{liminf}$ payoffs.

Abstract

We study Markov decision processes (MDPs) with a countably infinite number of states. The $lim sup$ (resp. $lim inf$ ) threshold objective is to maximize the probability that the $lim sup$ (resp. $lim inf$ ) of the infinite sequence of directly seen rewards is non-negative. We establish the complete picture of the strategy complexity of these objectives, i.e., the upper and lower bounds on the memory required by $ε$ -optimal (resp. optimal) strategies. We then apply these results to solve two open problems from (Sudderth, Decisions in Economics and Finance, 2020) about the strategy complexity of optimal strategies for the expected $lim sup$ (resp. $lim inf$ ) payoff.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Distributed systems and fault tolerance · Game Theory and Applications