Universal Complexity Bounds Based on Value Iteration for Stochastic Mean   Payoff Games and Entropy Games

Xavier Allamigeon; St\'ephane Gaubert; Ricardo D. Katz; Mateusz Skomra

arXiv:2206.09044·cs.GT·November 12, 2024·1 cites

Universal Complexity Bounds Based on Value Iteration for Stochastic Mean Payoff Games and Entropy Games

Xavier Allamigeon, St\'ephane Gaubert, Ricardo D. Katz, Mateusz Skomra

PDF

Open Access

TL;DR

This paper introduces value iteration algorithms for solving various stochastic mean-payoff and entropy games, providing complexity bounds and polynomial-time solutions for fixed-rank cases.

Contribution

It develops a unified value iteration framework with complexity bounds and applies it to improve solutions for mean-payoff and entropy games with fixed parameters.

Findings

01

Number of oracle calls is polynomial in dimension and inverse separation.

02

Turn-based mean-payoff games with fixed random positions are solvable in pseudo-polynomial time.

03

Entropy games with fixed rank can be solved in polynomial time, extended to weighted cases.

Abstract

We develop value iteration-based algorithms to solve in a unified manner different classes of combinatorial zero-sum games with mean-payoff type rewards. These algorithms rely on an oracle, evaluating the dynamic programming operator up to a given precision. We show that the number of calls to the oracle needed to determine exact optimal (positional) strategies is, up to a factor polynomial in the dimension, of order R/sep, where the "separation" sep is defined as the minimal difference between distinct values arising from strategies, and R is a metric estimate, involving the norm of approximate sub and super-eigenvectors of the dynamic programming operator. We illustrate this method by two applications. The first one is a new proof, leading to improved complexity estimates, of a theorem of Boros, Elbassioni, Gurvich and Makino, showing that turn-based mean-payoff games with a fixed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Economic theories and models · Artificial Intelligence in Games