AGDA+: Proximal Alternating Gradient Descent Ascent Method with a   Nonmonotone Adaptive Step-Size Search for Nonconvex Minimax Problems

Xuan Zhang; Qiushui Xu; and Necdet Serhat Aybat

arXiv:2406.14371·math.OC·January 30, 2025

AGDA+: Proximal Alternating Gradient Descent Ascent Method with a Nonmonotone Adaptive Step-Size Search for Nonconvex Minimax Problems

Xuan Zhang, Qiushui Xu, and Necdet Serhat Aybat

PDF

Open Access 1 Repo

TL;DR

This paper introduces AGDA+, an adaptive proximal gradient method for nonconvex-strongly concave minimax problems that automatically adjusts step sizes, achieving optimal complexity without prior parameter knowledge.

Contribution

AGDA+ is the first method with a nonmonotone step-size search for NCSC minimax problems, eliminating the need for tuning and exploiting local Lipschitz structure.

Findings

01

Achieves optimal $ ilde{O}(rac{1}{ ext{epsilon}^2})$ iteration complexity.

02

Requires only 3 gradient calls per backtracking iteration on average.

03

Demonstrates robustness and efficiency in numerical experiments.

Abstract

We consider double-regularized nonconvex-strongly concave (NCSC) minimax problems of the form $(P) : min_{x \in X} max_{y \in Y} g (x) + f (x, y) - h (y)$ , where $g$ , $h$ are closed convex, $f$ is $L$ -smooth in $(x, y)$ and strongly concave in $y$ . We propose a proximal alternating gradient descent ascent method AGDA+ that can adaptively choose nonmonotone primal-dual stepsizes to compute an approximate stationary point for $(P)$ without requiring the knowledge of the global Lipschitz constant $L$ and the concavity modulus $μ$ . Using a nonmonotone step-size search (backtracking) scheme, AGDA+ stands out by its ability to exploit the local Lipschitz structure and eliminates the need for precise tuning of hyper-parameters. AGDA+ achieves the optimal iteration complexity of $O (ϵ^{- 2})$ and it is the first step-size search method for NCSC minimax problems that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xuanzhangg/agda-plus
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research