Generic uniqueness of the bias vector of finite stochastic games with   perfect information

Marianne Akian; St\'ephane Gaubert; Antoine Hochart

arXiv:1610.09651·math.OC·December 30, 2019

Generic uniqueness of the bias vector of finite stochastic games with perfect information

Marianne Akian, St\'ephane Gaubert, Antoine Hochart

PDF

TL;DR

This paper proves that in finite perfect-information stochastic games, the bias vector is generically unique up to an additive constant, using max-plus algebra and nonlinear Perron-Frobenius theory, with applications to solving degenerate cases.

Contribution

It establishes the generic uniqueness of the bias vector in finite perfect-information stochastic games and introduces a perturbation scheme for degenerate instances.

Findings

01

Bias vector is generically unique up to an additive constant.

02

Techniques from max-plus algebra and nonlinear Perron-Frobenius theory are used.

03

Perturbation scheme helps solve degenerate stochastic games.

Abstract

Mean-payoff zero-sum stochastic games can be studied by means of a nonlinear spectral problem. When the state space is finite, the latter consists in finding an eigenpair $(u, λ)$ solution of $T (u) = λ e + u$ , where $T : R^{n} \to R^{n}$ is the Shapley (or dynamic programming) operator, $λ$ is a scalar, $e$ is the unit vector, and $u \in R^{n}$ . The scalar $λ$ yields the mean payoff per time unit, and the vector $u$ , called the bias, allows one to determine optimal stationary strategies. The existence of the eigenpair $(u, λ)$ is generally related to ergodicity conditions. A basic issue is to understand for which classes of games the bias vector is unique (up to an additive constant). In this paper, we consider perfect-information zero-sum stochastic games with finite state and action spaces, thinking of the transition payments as variable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.