On the Last-iterate Convergence in Time-varying Zero-sum Games: Extra Gradient Succeeds where Optimism Fails
Yi Feng, Hu Fu, Qun Hu, Ping Li, Ioannis Panageas, Bo Peng, Xiao Wang

TL;DR
This paper investigates the last-iterate convergence of optimization algorithms in time-varying zero-sum games, revealing that extra-gradient converges while optimistic gradient descent diverges in periodic settings, and all converge in stabilized perturbed games.
Contribution
It demonstrates that in time-varying zero-sum games, extra-gradient converges whereas OGDA diverges in periodic environments, and all algorithms converge in stabilized perturbed environments.
Findings
EG converges in periodic games, OGDA diverges
All algorithms converge in convergent perturbed games
First to show qualitative difference between EG and OGDA in time-varying settings
Abstract
Last-iterate convergence has received extensive study in two player zero-sum games starting from bilinear, convex-concave up to settings that satisfy the MVI condition. Typical methods that exhibit last-iterate convergence for the aforementioned games include extra-gradient (EG) and optimistic gradient descent ascent (OGDA). However, all the established last-iterate convergence results hold for the restrictive setting where the underlying repeated game does not change over time. Recently, a line of research has focused on regret analysis of OGDA in time-varying games, i.e., games where payoffs evolve with time; the last-iterate behavior of OGDA and EG in time-varying environments remains unclear though. In this paper, we study the last-iterate behavior of various algorithms in two types of unconstrained, time-varying, bilinear zero-sum games: periodic and convergent perturbed games.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods
