On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems
Tianyi Lin, Chi Jin, Michael I. Jordan

TL;DR
This paper analyzes the convergence of two-time-scale gradient descent ascent algorithms for nonconvex-concave minimax problems, providing the first nonasymptotic complexity results and explaining their practical success in training GANs.
Contribution
It offers the first nonasymptotic analysis of two-time-scale GDA for nonconvex-concave minimax problems, demonstrating its efficiency in finding stationary points.
Findings
GDA can find stationary points efficiently in nonconvex-concave problems.
Two-time-scale GDA outperforms single-step methods in convergence.
Results explain GDA's success in training GANs and similar applications.
Abstract
We consider nonconvex-concave minimax problems, , where is nonconvex in but concave in and is a convex and bounded set. One of the most popular algorithms for solving this problem is the celebrated gradient descent ascent (GDA) algorithm, which has been widely used in machine learning, control theory and economics. Despite the extensive convergence results for the convex-concave setting, GDA with equal stepsize can converge to limit cycles or even diverge in a general setting. In this paper, we present the complexity results on two-time-scale GDA for solving nonconvex-concave minimax problems, showing that the algorithm can find a stationary point of the function efficiently. To the best our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research
