Simple Uncoupled No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium
Gabriele Farina, Andrea Celli, Alberto Marchesi, Nicola Gatti

TL;DR
This paper introduces the first uncoupled no-regret learning dynamics that converge to extensive-form correlated equilibria in general-sum extensive-form games, addressing a longstanding open problem.
Contribution
It provides a novel, polynomial-time computable, uncoupled learning algorithm that guarantees convergence to EFCEs in extensive-form games with perfect recall.
Findings
Convergence to EFCE with high probability after T repetitions
Empirical frequency approaches an EFCE almost surely
Algorithm runs in polynomial time relative to game size
Abstract
The existence of simple uncoupled no-regret learning dynamics that converge to correlated equilibria in normal-form games is a celebrated result in the theory of multi-agent systems. Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium. Extensive-form games generalize normal-form games by modeling both sequential and simultaneous moves, as well as imperfect information. Because of the sequential nature and presence of private information in the game, correlation in extensive-form games possesses significantly different properties than its counterpart in normal-form games, many of which are still open research directions. Extensive-form correlated equilibrium (EFCE) has been proposed as the natural extensive-form…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Advanced Bandit Algorithms Research
