Universal Complexity Bounds Based on Value Iteration for Stochastic Mean Payoff Games and Entropy Games
Xavier Allamigeon, St\'ephane Gaubert, Ricardo D. Katz, Mateusz Skomra

TL;DR
This paper introduces value iteration algorithms for solving various stochastic mean-payoff and entropy games, providing complexity bounds and polynomial-time solutions for fixed-rank cases.
Contribution
It develops a unified value iteration framework with complexity bounds and applies it to improve solutions for mean-payoff and entropy games with fixed parameters.
Findings
Number of oracle calls is polynomial in dimension and inverse separation.
Turn-based mean-payoff games with fixed random positions are solvable in pseudo-polynomial time.
Entropy games with fixed rank can be solved in polynomial time, extended to weighted cases.
Abstract
We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the dimension, of order R/sep, where the "separation" sep is defined as the minimal difference between distinct values arising from strategies, and R is a metric estimate, involving the norm of approximate sub and super-eigenvectors of the dynamic programming operator. We illustrate this method by two applications. The first one is a new proof, leading to improved complexity estimates, of a theorem of Boros, Elbassioni, Gurvich and Makino, showing that turn-based mean-payoff games with a fixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Economic theories and models · Artificial Intelligence in Games
