Sure-almost-sure and Sure-limit-sure Window Mean Payoff in Markov Decision Processes

Pranshu Gaba; Shibashis Guha

arXiv:2605.12191·cs.GT·May 13, 2026

Sure-almost-sure and Sure-limit-sure Window Mean Payoff in Markov Decision Processes

Pranshu Gaba, Shibashis Guha

PDF

TL;DR

This paper addresses the computational complexity and strategy construction for sure-almost-sure and sure-limit-sure window mean-payoff objectives in Markov decision processes, providing complexity classifications and memory bounds.

Contribution

It solves the sure-almost-sure and sure-limit-sure problems for window mean-payoff objectives, establishing complexity results and strategy memory bounds.

Findings

01

Both problems are in P for fixed window length (if given in unary).

02

Both problems are in NP ∩ coNP for the bounded window length variant.

03

The paper provides bounds on the memory required for winning strategies.

Abstract

Given rationals $α$ and $β$ , the sure-almost-sure problem for a quantitative objective $φ$ in a Markov decision process (MDP) asks if one can simultaneously ensure that all outcomes of the MDP have $φ$ -value at least $α$ (i.e. sure $α$ satisfaction) and with probability $1$ the outcome has $φ$ -value at least $β$ (i.e. almost-sure $β$ satisfaction). The sure-limit-sure problem asks if for all $ε > 0$ one can simultaneously ensure that all outcomes have $φ$ -value at least $α$ and with probability at least $1 - ε$ the outcome has $φ$ -value at least $β$ . Moreover, if simultaneous satisfaction of objectives is possible, then one would also like to construct a strategy (for sure-almost-sure) or a family of strategies (for sure-limit-sure) that achieves this. In this paper, we solve the sure-almost-sure and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.