Solving irreducible stochastic mean-payoff games and entropy games by   relative Krasnoselskii-Mann iteration

Marianne Akian; St\'ephane Gaubert; Ulysse Naepels; Basile; Terver

arXiv:2305.02458·math.OC·May 5, 2023·1 cites

Solving irreducible stochastic mean-payoff games and entropy games by relative Krasnoselskii-Mann iteration

Marianne Akian, St\'ephane Gaubert, Ulysse Naepels, Basile, Terver

PDF

Open Access

TL;DR

This paper introduces an algorithm combining relative value iteration and Krasnoselskii-Mann damping to efficiently approximate values in irreducible stochastic mean-payoff and entropy games, with explicit complexity bounds.

Contribution

It provides the first parameterized complexity bounds for irreducible stochastic mean-payoff and entropy games using a novel iteration method.

Findings

01

Approximate game values in $O(| ext{log} \\epsilon|)$ iterations for irreducible stochastic games.

02

Derived explicit constants depending on minimal non-zero transition probabilities.

03

Established contraction properties of the iteration method via variational analysis.

Abstract

We analyse an algorithm solving stochastic mean-payoff games, combining the ideas of relative value iteration and of Krasnoselskii-Mann damping. We derive parameterized complexity bounds for several classes of games satisfying irreducibility conditions. We show in particular that an $ϵ$ -approximation of the value of an irreducible concurrent stochastic game can be computed in a number of iterations in $O (∣ lo g ϵ ∣)$ where the constant in the $O (\cdot)$ is explicit, depending on the smallest non-zero transition probabilities. This should be compared with a bound in $O (∣ ϵ ∣^{- 1} ∣ lo g (ϵ) ∣)$ obtained by Chatterjee and Ibsen-Jensen (ICALP 2014) for the same class of games, and to a $O (∣ ϵ ∣^{- 1})$ bound by Allamigeon, Gaubert, Katz and Skomra (ICALP 2022) for turn-based games. We also establish parameterized complexity bounds for entropy games, a class of matrix…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Computability, Logic, AI Algorithms · Model Reduction and Neural Networks