A Convex Programming-based Algorithm for Mean Payoff Stochastic Games with Perfect Information
Endre Boros, Khaled Elbassioni, Vladimir Gurvich, Kazuhisa Makino

TL;DR
This paper presents a convex programming approach to solve two-person zero-sum stochastic mean payoff games with perfect information, achieving pseudo-polynomial time complexity when the number of random positions is fixed.
Contribution
It introduces a convex programming-based algorithm that solves BWR-games in pseudo-polynomial time for a fixed number of random positions, addressing a long-standing open problem.
Findings
Solves BWR-games using convex programming.
Achieves pseudo-polynomial time complexity for fixed random positions.
Provides a new approach for an open problem in stochastic game theory.
Abstract
We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph , with local rewards , and three types of positions: black , white , and random forming a partition of . It is a long-standing open question whether a polynomial time algorithm for BWR-games exists, even when . In fact, a pseudo-polynomial algorithm for BWR-games would already imply their polynomial solvability. In this short note, we show that BWR-games can be solved via convex programming in pseudo-polynomial time if the number of random positions is a constant.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Game Theory and Voting Systems · Auction Theory and Applications
