Adaptive Single-Loop Methods for Stochastic Minimax Optimization on Riemannian Manifolds
Hongye Wang, Chang He, Bo Jiang

TL;DR
This paper introduces adaptive single-loop algorithms for stochastic minimax optimization on Riemannian manifolds, eliminating the need for prior knowledge of problem parameters and achieving improved convergence rates.
Contribution
The paper develops the first single-loop adaptive methods for Riemannian minimax problems, with convergence guarantees and practical effectiveness demonstrated through experiments.
Findings
Deterministic method attains $ ext{O}(rac{1}{ extepsilon^2})$ convergence.
Stochastic method achieves $ ext{O}(rac{1}{ extepsilon^6})$ complexity, improved to $ ext{O}(rac{1}{ extepsilon^4})$ under smoothness.
Numerical experiments show the methods are effective in real-world applications.
Abstract
Stochastic minimax optimization on Riemannian manifolds has recently attracted significant attention due to its broad range of applications, such as robust training of neural networks and robust maximum likelihood estimation. Existing optimization methods for these problems typically require selecting stepsizes based on prior knowledge of specific problem parameters, such as Lipschitz-type constants and (geodesic) strong concavity constants. Unfortunately, these parameters are often unknown in practice. To overcome this issue, we develop single-loop adaptive methods that automatically adjust stepsizes using cumulative Riemannian (stochastic) gradient norms. We first propose a deterministic single-loop Riemannian adaptive gradient descent ascent method and show that it attains an -stationary point within iterations. This deterministic method is of independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks
