Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex   Minimax Machine Learning

Ziyi Chen; Shaocong Ma; Yi Zhou

arXiv:2112.11663·cs.LG·May 23, 2022

Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex Minimax Machine Learning

Ziyi Chen, Shaocong Ma, Yi Zhou

PDF

Open Access

TL;DR

This paper introduces a fast, single-loop proximal gradient algorithm with momentum acceleration for nonconvex minimax problems, improving convergence efficiency over existing methods and demonstrating effectiveness in adversarial deep learning.

Contribution

It develops a novel accelerated AltGDA-type algorithm with proven convergence and better complexity bounds for nonconvex minimax optimization.

Findings

01

Achieves convergence to critical points in nonconvex minimax problems.

02

Reduces computational complexity compared to existing AltGDA algorithms.

03

Demonstrates effectiveness in adversarial deep learning experiments.

Abstract

Alternating gradient-descent-ascent (AltGDA) is an optimization algorithm that has been widely used for model training in various machine learning applications, which aims to solve a nonconvex minimax optimization problem. However, the existing studies show that it suffers from a high computation complexity in nonconvex minimax optimization. In this paper, we develop a single-loop and fast AltGDA-type algorithm that leverages proximal gradient updates and momentum acceleration to solve regularized nonconvex minimax optimization problems. By leveraging the momentum acceleration technique, we prove that the algorithm converges to a critical point in nonconvex minimax optimization and achieves a computation complexity in the order of $O (κ^{\frac{11}{6}} ϵ^{- 2})$ , where $ϵ$ is the desired level of accuracy and $κ$ is the problem's condition number. {Such a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Adversarial Robustness in Machine Learning