Optimistic Gradient Descent Ascent in Zero-Sum and General-Sum Bilinear Games
\'Etienne de Montbrun, J\'er\^ome Renault

TL;DR
This paper analyzes the convergence properties of Optimistic Gradient Descent Ascent (OGDA) in both zero-sum and general-sum bilinear games, providing new theoretical insights and practical algorithms for faster convergence and coordination.
Contribution
The paper extends convergence results of OGDA to zero-sum bilinear games with optimal geometric ratios and introduces OGDA for general-sum games, demonstrating improved convergence and coordination.
Findings
OGDA converges exponentially to saddle points in zero-sum games.
In some general-sum games, OGDA leads to exponential convergence to Nash or payoff divergence.
Using general-sum formulations can enhance min-max optimization algorithms.
Abstract
We study the convergence of Optimistic Gradient Descent Ascent in unconstrained bilinear games. In a first part, we consider the zero-sum case and extend previous results by Daskalakis et al. in 2018, Liang and Stokes in 2019, and others: we prove, for any payoff matrix, the exponential convergence of OGDA to a saddle point and also provide a new, optimal, geometric ratio for the convergence. We also characterize the step sizes inducing convergence, and are able to deduce the optimal step size for the speed of convergence. In a second part, we introduce OGDA for general-sum bilinear games: we show that in an interesting class of games, either OGDA converges exponentially fast to a Nash equilibrium, or the payoffs for both players converge exponentially fast to (which might be interpreted as endogenous emergence of coordination, or cooperation, among players). We also give…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Reinforcement Learning in Robotics · Markov Chains and Monte Carlo Methods
