A Pseudo-Polynomial Algorithm for Mean Payoff Stochastic Games with Perfect Information and Few Random Positions
Endre Boros, Khaled Elbassioni, Vladimir Gurvich, Kazuhisa Makino

TL;DR
This paper presents a pseudo-polynomial algorithm for solving BWR-games with perfect information and a fixed number of random positions, advancing the understanding of their computational complexity.
Contribution
It introduces a pseudo-polynomial time algorithm for BWR-games with a constant number of random positions, a significant step towards polynomial solvability.
Findings
Pseudo-polynomial algorithm for BWR-games with few random positions
Polynomial time solution for fixed number of random positions
Implication for the open problem of polynomial solvability of BWR-games
Abstract
We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph , with local rewards , and three types of positions: black , white , and random forming a partition of . It is a long-standing open question whether a polynomial time algorithm for BWR-games exists, or not, even when . In fact, a pseudo-polynomial algorithm for BWR-games would already imply their polynomial solvability. In this paper, we show that BWR-games with a constant number of random positions can be solved in pseudo-polynomial time. More precisely, in any BWR-game with , a saddle point in uniformly optimal pure stationary strategies can be found in time polynomial in , the maximum absolute local reward, and the common denominator of the transition probabilities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Game Theory and Voting Systems · Artificial Intelligence in Games
