Shuffling Gradient-Based Methods for Nonconvex-Concave Minimax Optimization
Quoc Tran-Dinh, Trang H. Tran, and Lam M. Nguyen

TL;DR
This paper introduces novel shuffling gradient-based algorithms for nonconvex minimax problems, achieving state-of-the-art complexity bounds and demonstrating competitive performance with SGD through numerical experiments.
Contribution
The paper develops new shuffling gradient methods for nonconvex minimax problems, providing the first complexity bounds for these settings and introducing a new hyper-gradient shuffling estimator.
Findings
Algorithms achieve state-of-the-art oracle complexity.
Numerical results show competitive performance with SGD.
New shuffling estimator improves hyper-gradient estimation.
Abstract
This paper aims at developing novel shuffling gradient-based methods for tackling two classes of minimax problems: nonconvex-linear and nonconvex-strongly concave settings. The first algorithm addresses the nonconvex-linear minimax model and achieves the state-of-the-art oracle complexity typically observed in nonconvex optimization. It also employs a new shuffling estimator for the "hyper-gradient", departing from standard shuffling techniques in optimization. The second method consists of two variants: semi-shuffling and full-shuffling schemes. These variants tackle the nonconvex-strongly concave minimax setting. We establish their oracle complexity bounds under standard assumptions, which, to our best knowledge, are the best-known for this specific setting. Numerical examples demonstrate the performance of our algorithms and compare them with two other methods. Our results show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
MethodsStochastic Gradient Descent
