Generic uniqueness of the bias vector of finite stochastic games with perfect information
Marianne Akian, St\'ephane Gaubert, Antoine Hochart

TL;DR
This paper proves that in finite perfect-information stochastic games, the bias vector is generically unique up to an additive constant, using max-plus algebra and nonlinear Perron-Frobenius theory, with applications to solving degenerate cases.
Contribution
It establishes the generic uniqueness of the bias vector in finite perfect-information stochastic games and introduces a perturbation scheme for degenerate instances.
Findings
Bias vector is generically unique up to an additive constant.
Techniques from max-plus algebra and nonlinear Perron-Frobenius theory are used.
Perturbation scheme helps solve degenerate stochastic games.
Abstract
Mean-payoff zero-sum stochastic games can be studied by means of a nonlinear spectral problem. When the state space is finite, the latter consists in finding an eigenpair solution of , where is the Shapley (or dynamic programming) operator, is a scalar, is the unit vector, and . The scalar yields the mean payoff per time unit, and the vector , called the bias, allows one to determine optimal stationary strategies. The existence of the eigenpair is generally related to ergodicity conditions. A basic issue is to understand for which classes of games the bias vector is unique (up to an additive constant). In this paper, we consider perfect-information zero-sum stochastic games with finite state and action spaces, thinking of the transition payments as variable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
