TL;DR
This paper investigates the computational complexity of finding stationary Nash equilibria in infinite-horizon general-sum stochastic games, revealing PPAD-hardness even in simplified turn-based cases, but identifying polynomial-time solutions under certain conditions.
Contribution
It proves PPAD-hardness for computing stationary NE in general-sum stochastic games and identifies polynomial-time algorithms for specific structured turn-based cases.
Findings
Computing NE is PPAD-hard in general stochastic games.
Turn-based stochastic games are also PPAD-hard.
Polynomial-time algorithms exist for certain structured turn-based games.
Abstract
We study the complexity of computing stationary Nash equilibrium (NE) in n-player infinite-horizon general-sum stochastic games. We focus on the problem of computing NE in such stochastic games when each player is restricted to choosing a stationary policy and rewards are discounted. First, we prove that computing such NE is in PPAD (in addition to clearly being PPAD-hard). Second, we consider turn-based specializations of such games where at each state there is at most a single player that can take actions and show that these (seemingly-simpler) games remain PPAD-hard. Third, we show that under further structural assumptions on the rewards computing NE in such turn-based games is possible in polynomial time. Towards achieving these results we establish structural facts about stochastic games of broader utility, including monotonicity of utilities under single-state single-action…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
The Complexity of Infinite-Horizon General-Sum Stochastic Games· youtube
