On Convergence of Gradient Descent Ascent: A Tight Local Analysis
Haochuan Li, Farzan Farnia, Subhro Das, Ali Jadbabaie

TL;DR
This paper provides a detailed local convergence analysis of Gradient Descent Ascent (GDA) for nonconvex-nonconcave minimax problems, revealing optimal stepsize ratios and convergence rates that align with practical observations.
Contribution
It establishes the necessary and sufficient stepsize ratio for local convergence of GDA to a Stackelberg Equilibrium in nonconvex-nonconcave settings, extending theoretical understanding.
Findings
A stepsize ratio of Θ(κ) is necessary and sufficient for local convergence.
The paper proves a nearly tight convergence rate with a matching lower bound.
Numerical experiments support the theoretical convergence guarantees.
Abstract
Gradient Descent Ascent (GDA) methods are the mainstream algorithms for minimax optimization in generative adversarial networks (GANs). Convergence properties of GDA have drawn significant interest in the recent literature. Specifically, for where is strongly-concave in and possibly nonconvex in , (Lin et al., 2020) proved the convergence of GDA with a stepsize ratio where and are the stepsizes for and and is the condition number for . While this stepsize ratio suggests a slow training of the min player, practical GAN algorithms typically adopt similar stepsizes for both variables, indicating a wide gap between theoretical and empirical results. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Random lasers and scattering media
