Efficient Decentralized Learning Dynamics for Extensive-Form Coarse   Correlated Equilibrium: No Expensive Computation of Stationary Distributions   Required

Gabriele Farina; Andrea Celli; Tuomas Sandholm

arXiv:2109.08138·cs.GT·September 17, 2021

Efficient Decentralized Learning Dynamics for Extensive-Form Coarse Correlated Equilibrium: No Expensive Computation of Stationary Distributions Required

Gabriele Farina, Andrea Celli, Tuomas Sandholm

PDF

Open Access

TL;DR

This paper introduces a new decentralized learning method for extensive-form coarse correlated equilibrium (EFCCE) that avoids expensive computations, guarantees convergence, and is simpler than existing methods for related solution concepts.

Contribution

It proposes a novel, efficient learning dynamics for EFCCE that does not require stationary distribution calculations, bridging the gap between EFCE and NFCCE in terms of computational complexity.

Findings

01

Guarantees $O(1/\sqrt{T})$-approximate EFCCE after T iterations

02

Almost sure convergence to EFCCE in the limit

03

Reduces computational complexity compared to EFCE dynamics

Abstract

While in two-player zero-sum games the Nash equilibrium is a well-established prescriptive notion of optimal play, its applicability as a prescriptive tool beyond that setting is limited. Consequently, the study of decentralized learning dynamics that guarantee convergence to correlated solution concepts in multiplayer, general-sum extensive-form (i.e., tree-form) games has become an important topic of active research. The per-iteration complexity of the currently known learning dynamics depends on the specific correlated solution concept considered. For example, in the case of extensive-form correlated equilibrium (EFCE), all known dynamics require, as an intermediate step at each iteration, to compute the stationary distribution of multiple Markov chains, an expensive operation in practice. Oppositely, in the case of normal-form coarse correlated equilibrium (NFCCE), simple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Reinforcement Learning in Robotics