Expected Window Mean-Payoff
Benjamin Bordais, Shibashis Guha, Jean-Fran\c{c}ois Raskin

TL;DR
This paper introduces methods to compute the expected supremum of window mean-payoff values in Markov chains and decision processes, extending prior work on ensuring minimum payoffs in two-player games.
Contribution
It defines and analyzes the expected value of the window mean-payoff function in stochastic models, including fixed and bounded window variants, with direct and prefix-independent versions.
Findings
Provides algorithms for expected window mean-payoff in Markov chains and MDPs.
Extends previous work on deterministic strategies to stochastic settings.
Analyzes both fixed and variable window length scenarios.
Abstract
In the window mean-payoff objective, given an infinite path, instead of considering a long run average, we consider the minimum payoff that can be ensured at every position of the path over a finite window that slides over the entire path. Chatterjee et al. studied the problem to decide if in a two-player game, Player 1 has a strategy to ensure a window mean-payoff of at least 0. In this work, we consider a function that given a path returns the supremum value of the window mean-payoff that can be ensured over the path and we show how to compute its expected value in Markov chains and Markov decision processes. We consider two variants of the function: Fixed window mean-payoff in which a fixed window length is provided; and Bounded window mean-payoff in which we compute the maximum possible value of the window mean-payoff over all possible window lengths. Further, for both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
