Concurrent Stochastic Games with Stateful-discounted and Parity Objectives: Complexity and Algorithms
Ali Asadi, Krishnendu Chatterjee, Raimundo Saona, Jakub Svoboda

TL;DR
This paper analyzes the complexity and algorithms for approximating the value in two-player zero-sum concurrent stochastic games with stateful-discounted and parity objectives, providing new complexity bounds and more efficient algorithms.
Contribution
It establishes TFNP[NP] complexity for the value approximation problem and introduces algorithms with improved action dependency, especially efficient for fixed number of states.
Findings
Complexity is TFNP[NP] for the value approximation.
Algorithms with logarithmic action dependence are developed.
Polynomial time algorithms are possible when the number of states is fixed.
Abstract
We study two-player zero-sum concurrent stochastic games with finite state and action space played for an infinite number of steps. In every step, the two players simultaneously and independently choose an action. Given the current state and the chosen actions, the next state is obtained according to a stochastic transition function. An objective is a measurable function on plays (or infinite trajectories) of the game, and the value for an objective is the maximal expectation that the player can guarantee against the adversarial player. We consider: (a) stateful-discounted objectives, which are similar to the classical discounted-sum objectives, but states are associated with different discount factors rather than a single discount factor; and (b) parity objectives, which are a canonical representation for -regular objectives. For stateful-discounted objectives, given an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
