On the Last-iterate Convergence in Time-varying Zero-sum Games: Extra   Gradient Succeeds where Optimism Fails

Yi Feng; Hu Fu; Qun Hu; Ping Li; Ioannis Panageas; Bo Peng; Xiao Wang

arXiv:2310.02604·cs.GT·October 5, 2023

On the Last-iterate Convergence in Time-varying Zero-sum Games: Extra Gradient Succeeds where Optimism Fails

Yi Feng, Hu Fu, Qun Hu, Ping Li, Ioannis Panageas, Bo Peng, Xiao Wang

PDF

Open Access

TL;DR

This paper investigates the last-iterate convergence of optimization algorithms in time-varying zero-sum games, revealing that extra-gradient converges while optimistic gradient descent diverges in periodic settings, and all converge in stabilized perturbed games.

Contribution

It demonstrates that in time-varying zero-sum games, extra-gradient converges whereas OGDA diverges in periodic environments, and all algorithms converge in stabilized perturbed environments.

Findings

01

EG converges in periodic games, OGDA diverges

02

All algorithms converge in convergent perturbed games

03

First to show qualitative difference between EG and OGDA in time-varying settings

Abstract

Last-iterate convergence has received extensive study in two player zero-sum games starting from bilinear, convex-concave up to settings that satisfy the MVI condition. Typical methods that exhibit last-iterate convergence for the aforementioned games include extra-gradient (EG) and optimistic gradient descent ascent (OGDA). However, all the established last-iterate convergence results hold for the restrictive setting where the underlying repeated game does not change over time. Recently, a line of research has focused on regret analysis of OGDA in time-varying games, i.e., games where payoffs evolve with time; the last-iterate behavior of OGDA and EG in time-varying environments remains unclear though. In this paper, we study the last-iterate behavior of various algorithms in two types of unconstrained, time-varying, bilinear zero-sum games: periodic and convergent perturbed games.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods