On Convergence of Gradient Descent Ascent: A Tight Local Analysis

Haochuan Li; Farzan Farnia; Subhro Das; Ali Jadbabaie

arXiv:2207.00957·math.OC·July 5, 2022

On Convergence of Gradient Descent Ascent: A Tight Local Analysis

Haochuan Li, Farzan Farnia, Subhro Das, Ali Jadbabaie

PDF

Open Access

TL;DR

This paper provides a detailed local convergence analysis of Gradient Descent Ascent (GDA) for nonconvex-nonconcave minimax problems, revealing optimal stepsize ratios and convergence rates that align with practical observations.

Contribution

It establishes the necessary and sufficient stepsize ratio for local convergence of GDA to a Stackelberg Equilibrium in nonconvex-nonconcave settings, extending theoretical understanding.

Findings

01

A stepsize ratio of Θ(κ) is necessary and sufficient for local convergence.

02

The paper proves a nearly tight convergence rate with a matching lower bound.

03

Numerical experiments support the theoretical convergence guarantees.

Abstract

Gradient Descent Ascent (GDA) methods are the mainstream algorithms for minimax optimization in generative adversarial networks (GANs). Convergence properties of GDA have drawn significant interest in the recent literature. Specifically, for $min_{x} max_{y} f (x; y)$ where $f$ is strongly-concave in $y$ and possibly nonconvex in $x$ , (Lin et al., 2020) proved the convergence of GDA with a stepsize ratio $η_{y} / η_{x} = Θ (κ^{2})$ where $η_{x}$ and $η_{y}$ are the stepsizes for $x$ and $y$ and $κ$ is the condition number for $y$ . While this stepsize ratio suggests a slow training of the min player, practical GAN algorithms typically adopt similar stepsizes for both variables, indicating a wide gap between theoretical and empirical results. In this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Random lasers and scattering media