AGDA+: Proximal Alternating Gradient Descent Ascent Method with a Nonmonotone Adaptive Step-Size Search for Nonconvex Minimax Problems
Xuan Zhang, Qiushui Xu, and Necdet Serhat Aybat

TL;DR
This paper introduces AGDA+, an adaptive proximal gradient method for nonconvex-strongly concave minimax problems that automatically adjusts step sizes, achieving optimal complexity without prior parameter knowledge.
Contribution
AGDA+ is the first method with a nonmonotone step-size search for NCSC minimax problems, eliminating the need for tuning and exploiting local Lipschitz structure.
Findings
Achieves optimal $ ilde{O}(rac{1}{ ext{epsilon}^2})$ iteration complexity.
Requires only 3 gradient calls per backtracking iteration on average.
Demonstrates robustness and efficiency in numerical experiments.
Abstract
We consider double-regularized nonconvex-strongly concave (NCSC) minimax problems of the form , where , are closed convex, is -smooth in and strongly concave in . We propose a proximal alternating gradient descent ascent method AGDA+ that can adaptively choose nonmonotone primal-dual stepsizes to compute an approximate stationary point for without requiring the knowledge of the global Lipschitz constant and the concavity modulus . Using a nonmonotone step-size search (backtracking) scheme, AGDA+ stands out by its ability to exploit the local Lipschitz structure and eliminates the need for precise tuning of hyper-parameters. AGDA+ achieves the optimal iteration complexity of and it is the first step-size search method for NCSC minimax problems that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
