Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization
Guodong Zhang, Yuanhao Wang, Laurent Lessard, Roger Grosse

TL;DR
This paper proves that alternating gradient descent-ascent (Alt-GDA) converges faster than simultaneous GDA in minimax problems, especially in strongly convex-strongly concave cases, and demonstrates its practical benefits in GAN training.
Contribution
It provides the first theoretical proof that Alt-GDA outperforms Sim-GDA in convergence speed for certain minimax problems and extends the analysis to global convergence for specific subclasses.
Findings
Alt-GDA achieves near-optimal local convergence in SCSC problems.
Alt-GDA converges faster than Sim-GDA by more than a constant.
Alternating updates significantly speed up GAN training.
Abstract
Smooth minimax games often proceed by simultaneous or alternating gradient updates. Although algorithms with alternating updates are commonly used in practice, the majority of existing theoretical analyses focus on simultaneous algorithms for convenience of analysis. In this paper, we study alternating gradient descent-ascent (Alt-GDA) in minimax games and show that Alt-GDA is superior to its simultaneous counterpart~(Sim-GDA) in many settings. We prove that Alt-GDA achieves a near-optimal local convergence rate for strongly convex-strongly concave (SCSC) problems while Sim-GDA converges at a much slower rate. To our knowledge, this is the \emph{first} result of any setting showing that Alt-GDA converges faster than Sim-GDA by more than a constant. We further adapt the theory of integral quadratic constraints (IQC) and show that Alt-GDA attains the same rate \emph{globally} for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research
