Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games
Ioannis Anagnostides, Gabriele Farina, Ioannis Panageas, Tuomas, Sandholm

TL;DR
This paper demonstrates that optimistic mirror descent in bimatrix games either converges to an approximate Nash equilibrium or to a strong coarse correlated equilibrium, often faster than convergence to Nash in general-sum games.
Contribution
It provides theoretical guarantees for convergence of optimistic mirror descent to either approximate Nash or strong CCE in bimatrix games, highlighting faster convergence to CCE.
Findings
Optimistic mirror descent converges to approximate NE or strong CCE.
Convergence to CCE can occur after only a few iterations if NE is not approached.
Regret decays linearly when NE is not reached, indicating cycling behavior is efficient.
Abstract
We show that, for any sufficiently small fixed , when both players in a general-sum two-player (bimatrix) game employ optimistic mirror descent (OMD) with smooth regularization, learning rate and repetitions, either the dynamics reach an -approximate Nash equilibrium (NE), or the average correlated distribution of play is an -strong coarse correlated equilibrium (CCE): any possible unilateral deviation does not only leave the player worse, but will decrease its utility by . As an immediate consequence, when the iterates of OMD are bounded away from being Nash equilibria in a bimatrix game, we guarantee convergence to an exact CCE after only iterations. Our results reveal that uncoupled no-regret learning algorithms can converge to CCE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Quantum many-body systems · Game Theory and Applications
