Bounded-Memory Strategies in Partial-Information Games
Sougata Bose, Rasmus Ibsen-Jensen, Patrick Totzke

TL;DR
This paper investigates the computational complexity of finding bounded-memory strategies in partial-information stochastic games, providing complexity bounds and algorithms for approximating equilibria and game values.
Contribution
It establishes NP-hardness results and offers polynomial space algorithms for deciding the existence of bounded-memory equilibria in multi-player games.
Findings
NP-hardness for approximating zero-sum values with memoryless strategies
Polynomial space decision procedure for bounded-memory $oldsymbol{ ext{$orall$}k}$-player games
Algorithms for finding $oldsymbol{ ext{$orall$}k}$-player and 2-player bounded-memory $oldsymbol{ ext{$orall$}b}$-strategy equilibria
Abstract
We study the computational complexity of solving stochastic games with mean-payoff objectives. Instead of identifying special classes in which simple strategies are sufficient to play -optimally, or form -Nash equilibria, we consider general partial-information multiplayer games and ask what can be achieved with (and against) finite-memory strategies up to a {given} bound on the memory. We show -hardness for approximating zero-sum values, already with respect to memoryless strategies and for 1-player reachability games. On the other hand, we provide upper bounds for solving games of any fixed number of players . We show that one can decide in polynomial space if, for a given -player game, and bound , there exists an -Nash equilibrium in which all strategies use at most memory modes. For given , finding an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Optimization and Search Problems · Computability, Logic, AI Algorithms
