Value-Set Iteration: Computing Optimal Correlated Equilibria in Infinite-Horizon Multi-Player Stochastic Games
Jiarui Gan, Rupak Majumdar

TL;DR
This paper introduces a polynomial-time algorithm for computing near-optimal correlated equilibria in infinite-horizon multi-player stochastic games, addressing the challenge of history-dependent policies through a novel value-set iteration method.
Contribution
It presents a new polynomial-time algorithm for approximate correlated equilibria in complex stochastic games using inducible value sets and fixed point characterization.
Findings
Algorithm computes $(psilon,elta)$-optimal CEs efficiently
Complexity is polynomial in key parameters for general games
Turn-based games have reduced polynomial complexity
Abstract
We study the problem of computing optimal correlated equilibria (CEs) in infinite-horizon multi-player stochastic games, where correlation signals are provided over time. In this setting, optimal CEs require history-dependent policies; this poses new representational and algorithmic challenges as the number of possible histories grows exponentially with the number of time steps. We focus on computing -optimal CEs -- solutions that achieve a value within of an optimal CE, while allowing the agents' incentive constraints to be violated by at most . Our main result is an algorithm that computes an -optimal CE in time polynomial in , where is the discount factor, and is the number of agents. For (a slightly more general variant of) turn-based games, we further reduce the complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Reinforcement Learning in Robotics · Auction Theory and Applications
