Near-Optimal Decentralized Momentum Method for Nonconvex-PL Minimax   Problems

Feihu Huang; Songcan Chen

arXiv:2304.10902·math.OC·April 24, 2023·1 cites

Near-Optimal Decentralized Momentum Method for Nonconvex-PL Minimax Problems

Feihu Huang, Songcan Chen

PDF

Open Access

TL;DR

This paper introduces a decentralized momentum-based gradient method for nonconvex-PL stochastic minimax problems, achieving near-optimal convergence rates in distributed machine learning scenarios.

Contribution

It proposes the first decentralized algorithm for nonconvex-PL stochastic minimax problems with theoretical convergence guarantees.

Findings

01

Achieves near-optimal gradient complexity of O(ε^{-3})

02

First to study decentralized algorithms for nonconvex-PL minimax problems

03

Convergence analysis confirms efficiency in distributed settings

Abstract

Minimax optimization plays an important role in many machine learning tasks such as generative adversarial networks (GANs) and adversarial training. Although recently a wide variety of optimization methods have been proposed to solve the minimax problems, most of them ignore the distributed setting where the data is distributed on multiple workers. Meanwhile, the existing decentralized minimax optimization methods rely on the strictly assumptions such as (strongly) concavity and variational inequality conditions. In the paper, thus, we propose an efficient decentralized momentum-based gradient descent ascent (DM-GDA) method for the distributed nonconvex-PL minimax optimization, which is nonconvex in primal variable and is nonconcave in dual variable and satisfies the Polyak-Lojasiewicz (PL) condition. In particular, our DM-GDA method simultaneously uses the momentum-based techniques to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques