Decentralized Stochastic Gradient Descent Ascent for Finite-Sum Minimax Problems
Hongchang Gao

TL;DR
This paper introduces a novel decentralized stochastic gradient descent ascent method for finite-sum minimax problems, achieving new theoretical complexities and demonstrating effectiveness in AUC maximization.
Contribution
It develops the first decentralized stochastic method with variance reduction for finite-sum minimax problems, providing optimal theoretical complexity bounds.
Findings
Achieves $O(rac{ ext{sqrt}(n) ext{κ}^3}{(1- ext{λ})^2 ext{ε}^2})$ sample complexity.
Achieves $O(rac{ ext{κ}^3}{(1- ext{λ})^2 ext{ε}^2})$ communication complexity.
Experimental results confirm the method's effectiveness in AUC maximization.
Abstract
Minimax optimization problems have attracted significant attention in recent years due to their widespread application in numerous machine learning models. To solve the minimax problem, a wide variety of stochastic optimization methods have been proposed. However, most of them ignore the distributed setting where the training data is distributed on multiple workers. In this paper, we developed a novel decentralized stochastic gradient descent ascent method for the finite-sum minimax problem. In particular, by employing the variance-reduced gradient, our method can achieve sample complexity and communication complexity for the nonconvex-strongly-concave minimax problem. As far as we know, our work is the first one to achieve such theoretical complexities for this kind of minimax problem.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
