On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems

Tianyi Lin; Chi Jin; Michael I. Jordan

arXiv:1906.00331·cs.LG·May 6, 2024·118 cites

On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems

Tianyi Lin, Chi Jin, Michael I. Jordan

PDF

Open Access 1 Video

TL;DR

This paper analyzes the convergence of two-time-scale gradient descent ascent algorithms for nonconvex-concave minimax problems, providing the first nonasymptotic complexity results and explaining their practical success in training GANs.

Contribution

It offers the first nonasymptotic analysis of two-time-scale GDA for nonconvex-concave minimax problems, demonstrating its efficiency in finding stationary points.

Findings

01

GDA can find stationary points efficiently in nonconvex-concave problems.

02

Two-time-scale GDA outperforms single-step methods in convergence.

03

Results explain GDA's success in training GANs and similar applications.

Abstract

We consider nonconvex-concave minimax problems, $min_{x} max_{y \in Y} f (x, y)$ , where $f$ is nonconvex in $x$ but concave in $y$ and $Y$ is a convex and bounded set. One of the most popular algorithms for solving this problem is the celebrated gradient descent ascent (GDA) algorithm, which has been widely used in machine learning, control theory and economics. Despite the extensive convergence results for the convex-concave setting, GDA with equal stepsize can converge to limit cycles or even diverge in a general setting. In this paper, we present the complexity results on two-time-scale GDA for solving nonconvex-concave minimax problems, showing that the algorithm can find a stationary point of the function $Φ (\cdot) := max_{y \in Y} f (\cdot, y)$ efficiently. To the best our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research