Efficient Decentralized Learning Dynamics for Extensive-Form Coarse Correlated Equilibrium: No Expensive Computation of Stationary Distributions Required
Gabriele Farina, Andrea Celli, Tuomas Sandholm

TL;DR
This paper introduces a new decentralized learning method for extensive-form coarse correlated equilibrium (EFCCE) that avoids expensive computations, guarantees convergence, and is simpler than existing methods for related solution concepts.
Contribution
It proposes a novel, efficient learning dynamics for EFCCE that does not require stationary distribution calculations, bridging the gap between EFCE and NFCCE in terms of computational complexity.
Findings
Guarantees $O(1/\sqrt{T})$-approximate EFCCE after T iterations
Almost sure convergence to EFCCE in the limit
Reduces computational complexity compared to EFCE dynamics
Abstract
While in two-player zero-sum games the Nash equilibrium is a well-established prescriptive notion of optimal play, its applicability as a prescriptive tool beyond that setting is limited. Consequently, the study of decentralized learning dynamics that guarantee convergence to correlated solution concepts in multiplayer, general-sum extensive-form (i.e., tree-form) games has become an important topic of active research. The per-iteration complexity of the currently known learning dynamics depends on the specific correlated solution concept considered. For example, in the case of extensive-form correlated equilibrium (EFCE), all known dynamics require, as an intermediate step at each iteration, to compute the stationary distribution of multiple Markov chains, an expensive operation in practice. Oppositely, in the case of normal-form coarse correlated equilibrium (NFCCE), simple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Reinforcement Learning in Robotics
