Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex Minimax Machine Learning
Ziyi Chen, Shaocong Ma, Yi Zhou

TL;DR
This paper introduces a fast, single-loop proximal gradient algorithm with momentum acceleration for nonconvex minimax problems, improving convergence efficiency over existing methods and demonstrating effectiveness in adversarial deep learning.
Contribution
It develops a novel accelerated AltGDA-type algorithm with proven convergence and better complexity bounds for nonconvex minimax optimization.
Findings
Achieves convergence to critical points in nonconvex minimax problems.
Reduces computational complexity compared to existing AltGDA algorithms.
Demonstrates effectiveness in adversarial deep learning experiments.
Abstract
Alternating gradient-descent-ascent (AltGDA) is an optimization algorithm that has been widely used for model training in various machine learning applications, which aims to solve a nonconvex minimax optimization problem. However, the existing studies show that it suffers from a high computation complexity in nonconvex minimax optimization. In this paper, we develop a single-loop and fast AltGDA-type algorithm that leverages proximal gradient updates and momentum acceleration to solve regularized nonconvex minimax optimization problems. By leveraging the momentum acceleration technique, we prove that the algorithm converges to a critical point in nonconvex minimax optimization and achieves a computation complexity in the order of , where is the desired level of accuracy and is the problem's condition number. {Such a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Adversarial Robustness in Machine Learning
