Decentralized Riemannian Algorithm for Nonconvex Minimax Problems
Xidong Wu, Zhengmian Hu, Heng Huang

TL;DR
This paper introduces the first decentralized algorithms for nonconvex minimax problems on Riemannian manifolds, specifically the Stiefel manifold, with theoretical convergence guarantees and empirical validation in neural network training.
Contribution
It develops deterministic and stochastic decentralized minimax algorithms for nonconvex Riemannian problems, providing the first convergence guarantees in this setting.
Findings
Deterministic method achieves $O( \\epsilon^{-2})$ gradient complexity.
Stochastic method achieves $O( \\epsilon^{-4})$ gradient complexity.
Algorithms demonstrate efficiency in neural network training over the Stiefel manifold.
Abstract
The minimax optimization over Riemannian manifolds (possibly nonconvex constraints) has been actively applied to solve many problems, such as robust dimensionality reduction and deep neural networks with orthogonal weights (Stiefel manifold). Although many optimization algorithms for minimax problems have been developed in the Euclidean setting, it is difficult to convert them into Riemannian cases, and algorithms for nonconvex minimax problems with nonconvex constraints are even rare. On the other hand, to address the big data challenges, decentralized (serverless) training techniques have recently been emerging since they can reduce communications overhead and avoid the bottleneck problem on the server node. Nonetheless, the algorithm for decentralized Riemannian minimax problems has not been studied. In this paper, we study the distributed nonconvex-strongly-concave minimax…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
