Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse   Correlated Equilibria in Bimatrix Games

Ioannis Anagnostides; Gabriele Farina; Ioannis Panageas; Tuomas; Sandholm

arXiv:2203.12074·cs.GT·October 10, 2022·1 cites

Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games

Ioannis Anagnostides, Gabriele Farina, Ioannis Panageas, Tuomas, Sandholm

PDF

Open Access 1 Video

TL;DR

This paper demonstrates that optimistic mirror descent in bimatrix games either converges to an approximate Nash equilibrium or to a strong coarse correlated equilibrium, often faster than convergence to Nash in general-sum games.

Contribution

It provides theoretical guarantees for convergence of optimistic mirror descent to either approximate Nash or strong CCE in bimatrix games, highlighting faster convergence to CCE.

Findings

01

Optimistic mirror descent converges to approximate NE or strong CCE.

02

Convergence to CCE can occur after only a few iterations if NE is not approached.

03

Regret decays linearly when NE is not reached, indicating cycling behavior is efficient.

Abstract

We show that, for any sufficiently small fixed $ϵ > 0$ , when both players in a general-sum two-player (bimatrix) game employ optimistic mirror descent (OMD) with smooth regularization, learning rate $η = O (ϵ^{2})$ and $T = Ω (poly (1/ ϵ))$ repetitions, either the dynamics reach an $ϵ$ -approximate Nash equilibrium (NE), or the average correlated distribution of play is an $Ω (poly (ϵ))$ -strong coarse correlated equilibrium (CCE): any possible unilateral deviation does not only leave the player worse, but will decrease its utility by $Ω (poly (ϵ))$ . As an immediate consequence, when the iterates of OMD are bounded away from being Nash equilibria in a bimatrix game, we guarantee convergence to an exact CCE after only $O (1)$ iterations. Our results reveal that uncoupled no-regret learning algorithms can converge to CCE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Quantum many-body systems · Game Theory and Applications