Uniform Value and Decidability in Ergodic Blind Stochastic Games
Krishnendu Chatterjee, David Lurie, Raimundo Saona, Bruno Ziliotto

TL;DR
This paper introduces ergodic blind stochastic games, proving the existence of a uniform value and its approximability, which is a novel result even for POMDPs, and shows the value's independence from initial beliefs.
Contribution
The paper establishes the existence and approximability of the uniform value in ergodic blind stochastic games, a significant advancement in understanding these games and POMDPs.
Findings
Proves the existence of the uniform value in ergodic blind stochastic games.
Provides an algorithm to approximate the uniform value, demonstrating decidability.
Shows the uniform value is independent of initial beliefs.
Abstract
We study a class of two-player zero-sum stochastic games known as \textit{blind stochastic games}, where players neither observe the state nor receive any information about it during the game. A central concept for analyzing long-duration stochastic games is the \textit{uniform value}. A game has a uniform value if for every , Player 1 (resp., Player 2) has a strategy such that, for all sufficiently large , his average payoff over stages is at least (resp., at most ). Prior work has shown that the uniform value may not exist in general blind stochastic games. To address this, we introduce a subclass called \textit{ergodic blind stochastic games}, defined by imposing an ergodicity condition on the state transitions. For this subclass, we prove the existence of the uniform value and provide an algorithm to approximate it,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFormal Methods in Verification · Reinforcement Learning in Robotics · Advanced Queuing Theory Analysis
