Decentralized Stochastic Gradient Descent Ascent for Finite-Sum Minimax   Problems

Hongchang Gao

arXiv:2212.02724·cs.LG·June 12, 2024·5 cites

Decentralized Stochastic Gradient Descent Ascent for Finite-Sum Minimax Problems

Hongchang Gao

PDF

Open Access

TL;DR

This paper introduces a novel decentralized stochastic gradient descent ascent method for finite-sum minimax problems, achieving new theoretical complexities and demonstrating effectiveness in AUC maximization.

Contribution

It develops the first decentralized stochastic method with variance reduction for finite-sum minimax problems, providing optimal theoretical complexity bounds.

Findings

01

Achieves $O(rac{ ext{sqrt}(n) ext{κ}^3}{(1- ext{λ})^2 ext{ε}^2})$ sample complexity.

02

Achieves $O(rac{ ext{κ}^3}{(1- ext{λ})^2 ext{ε}^2})$ communication complexity.

03

Experimental results confirm the method's effectiveness in AUC maximization.

Abstract

Minimax optimization problems have attracted significant attention in recent years due to their widespread application in numerous machine learning models. To solve the minimax problem, a wide variety of stochastic optimization methods have been proposed. However, most of them ignore the distributed setting where the training data is distributed on multiple workers. In this paper, we developed a novel decentralized stochastic gradient descent ascent method for the finite-sum minimax problem. In particular, by employing the variance-reduced gradient, our method can achieve $O (\frac{n κ ^{3}}{( 1 - λ ) ^{2} ϵ ^{2}})$ sample complexity and $O (\frac{κ ^{3}}{( 1 - λ ) ^{2} ϵ ^{2}})$ communication complexity for the nonconvex-strongly-concave minimax problem. As far as we know, our work is the first one to achieve such theoretical complexities for this kind of minimax problem.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques