Gradient Free Minimax Optimization: Variance Reduction and Faster Convergence
Tengyu Xu, Zhe Wang, Yingbin Liang, H. Vincent Poor

TL;DR
This paper introduces a novel zeroth-order variance reduction algorithm for nonconvex-strongly-concave minimax problems, achieving the best known query complexity and outperforming previous methods, with demonstrated effectiveness in black-box robust optimization.
Contribution
It presents the first zeroth-order minimax optimization method with variance reduction that attains optimal complexity bounds without relying on diminishing stepsizes.
Findings
Achieves query complexity of O(κ(d_1 + d_2)ε^{-3})
Outperforms all previous complexity bounds by orders of magnitude
Demonstrates superior performance in black-box distributional robust optimization
Abstract
Many important machine learning applications amount to solving minimax optimization problems, and in many cases there is no access to the gradient information, but only the function values. In this paper, we focus on such a gradient-free setting, and consider the nonconvex-strongly-concave minimax stochastic optimization problem. In the literature, various zeroth-order (i.e., gradient-free) minimax methods have been proposed, but none of them achieve the potentially feasible computational complexity of suggested by the stochastic nonconvex minimization theorem. In this paper, we adopt the variance reduction technique to design a novel zeroth-order variance reduced gradient descent ascent (ZO-VRGDA) algorithm. We show that the ZO-VRGDA algorithm achieves the best known query complexity of , which outperforms all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Complexity and Algorithms in Graphs
