A Convex Programming-based Algorithm for Mean Payoff Stochastic Games   with Perfect Information

Endre Boros; Khaled Elbassioni; Vladimir Gurvich; Kazuhisa Makino

arXiv:1610.06681·cs.DS·October 24, 2016

A Convex Programming-based Algorithm for Mean Payoff Stochastic Games with Perfect Information

Endre Boros, Khaled Elbassioni, Vladimir Gurvich, Kazuhisa Makino

PDF

Open Access

TL;DR

This paper presents a convex programming approach to solve two-person zero-sum stochastic mean payoff games with perfect information, achieving pseudo-polynomial time complexity when the number of random positions is fixed.

Contribution

It introduces a convex programming-based algorithm that solves BWR-games in pseudo-polynomial time for a fixed number of random positions, addressing a long-standing open problem.

Findings

01

Solves BWR-games using convex programming.

02

Achieves pseudo-polynomial time complexity for fixed random positions.

03

Provides a new approach for an open problem in stochastic game theory.

Abstract

We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph $G = (V, E)$ , with local rewards $r : E \to \ZZ$ , and three types of positions: black $V_{B}$ , white $V_{W}$ , and random $V_{R}$ forming a partition of $V$ . It is a long-standing open question whether a polynomial time algorithm for BWR-games exists, even when $∣ V_{R} ∣ = 0$ . In fact, a pseudo-polynomial algorithm for BWR-games would already imply their polynomial solvability. In this short note, we show that BWR-games can be solved via convex programming in pseudo-polynomial time if the number of random positions is a constant.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Game Theory and Voting Systems · Auction Theory and Applications