Arena-Independent Finite-Memory Determinacy in Stochastic Games
Patricia Bouyer, Youssouf Oualhadj, Mickael Randour, Pierre Vandenhove

TL;DR
This paper investigates the complexity of strategies in stochastic zero-sum games, focusing on arena-independent finite-memory strategies, and extends deterministic game results to stochastic settings.
Contribution
It characterizes when arena-independent finite-memory strategies suffice in stochastic games and reduces their analysis to simpler one-player cases.
Findings
Pure AIFM strategies suffice for optimal play and subgame perfect strategies.
Reduction of two-player stochastic game analysis to one-player cases.
Characterization of objectives requiring AIFM strategies based on intuitive properties.
Abstract
We study stochastic zero-sum games on graphs, which are prevalent tools to model decision-making in presence of an antagonistic opponent in a random environment. In this setting, an important question is the one of strategy complexity: what kinds of strategies are sufficient or required to play optimally (e.g., randomization or memory requirements)? Our contributions further the understanding of arena-independent finite-memory (AIFM) determinacy, i.e., the study of objectives for which memory is needed, but in a way that only depends on limited parameters of the game graphs. First, we show that objectives for which pure AIFM strategies suffice to play optimally also admit pure AIFM subgame perfect strategies. Second, we show that we can reduce the study of objectives for which pure AIFM strategies suffice in two-player stochastic games to the easier study of one-player stochastic games…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Artificial Intelligence in Games · Reinforcement Learning in Robotics
