Strategy Complexity of Limsup and Liminf Threshold Objectives in Countable MDPs, with Applications to Optimal Expected Payoffs
Richard Mayr, Eric Munday

TL;DR
This paper analyzes the strategy complexity in countable Markov decision processes for $ ext{limsup}$ and $ ext{liminf}$ threshold objectives, providing bounds on memory requirements and solving open problems related to optimal expected payoffs.
Contribution
It establishes the complete strategy complexity bounds for $ ext{limsup}$ and $ ext{liminf}$ objectives in countable MDPs and applies these results to open problems on optimal strategies for expected payoffs.
Findings
Bounds on memory requirements for $ ext{limsup}$ and $ ext{liminf}$ objectives.
Complete characterization of strategy complexity in countable MDPs.
Resolution of open problems on optimal strategies for expected $ ext{limsup}$ and $ ext{liminf}$ payoffs.
Abstract
We study Markov decision processes (MDPs) with a countably infinite number of states. The (resp. ) threshold objective is to maximize the probability that the (resp. ) of the infinite sequence of directly seen rewards is non-negative. We establish the complete picture of the strategy complexity of these objectives, i.e., the upper and lower bounds on the memory required by -optimal (resp. optimal) strategies. We then apply these results to solve two open problems from (Sudderth, Decisions in Economics and Finance, 2020) about the strategy complexity of optimal strategies for the expected (resp. ) payoff.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Distributed systems and fault tolerance · Game Theory and Applications
