On Last-Iterate Convergence Beyond Zero-Sum Games

Ioannis Anagnostides; Ioannis Panageas; Gabriele Farina; Tuomas; Sandholm

arXiv:2203.12056·cs.GT·March 24, 2022·6 cites

On Last-Iterate Convergence Beyond Zero-Sum Games

Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas, Sandholm

PDF

Open Access

TL;DR

This paper extends last-iterate convergence results beyond zero-sum games, introducing new techniques for broader game classes and learning dynamics, with implications for equilibrium approximation and efficiency.

Contribution

It provides novel regret-based analysis and convergence guarantees for optimistic mirror descent and gradient descent in diverse game settings beyond zero-sum scenarios.

Findings

01

OMD has bounded second-order path lengths in certain games.

02

OMD achieves $O(1/ oot{2}T)$ convergence rates and $O(1)$ regret bounds.

03

In potential games, convergence to an $ ext{epsilon}$-equilibrium occurs in $O(1/ ext{epsilon}^2)$ iterations.

Abstract

Most existing results about \emph{last-iterate convergence} of learning dynamics are limited to two-player zero-sum games, and only apply under rigid assumptions about what dynamics the players follow. In this paper we provide new results and techniques that apply to broader families of games and learning dynamics. First, we use a regret-based analysis to show that in a class of games that includes constant-sum polymatrix and strategically zero-sum games, dynamics such as \emph{optimistic mirror descent (OMD)} have \emph{bounded second-order path lengths}, a property which holds even when players employ different algorithms and prediction mechanisms. This enables us to obtain $O (1/ T)$ rates and optimal $O (1)$ regret bounds. Our analysis also reveals a surprising property: OMD either reaches arbitrarily close to a Nash equilibrium, or it outperforms the \emph{robust price of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms