Gradient Free Minimax Optimization: Variance Reduction and Faster   Convergence

Tengyu Xu; Zhe Wang; Yingbin Liang; H. Vincent Poor

arXiv:2006.09361·cs.LG·March 23, 2021·5 cites

Gradient Free Minimax Optimization: Variance Reduction and Faster Convergence

Tengyu Xu, Zhe Wang, Yingbin Liang, H. Vincent Poor

PDF

Open Access

TL;DR

This paper introduces a novel zeroth-order variance reduction algorithm for nonconvex-strongly-concave minimax problems, achieving the best known query complexity and outperforming previous methods, with demonstrated effectiveness in black-box robust optimization.

Contribution

It presents the first zeroth-order minimax optimization method with variance reduction that attains optimal complexity bounds without relying on diminishing stepsizes.

Findings

01

Achieves query complexity of O(κ(d_1 + d_2)ε^{-3})

02

Outperforms all previous complexity bounds by orders of magnitude

03

Demonstrates superior performance in black-box distributional robust optimization

Abstract

Many important machine learning applications amount to solving minimax optimization problems, and in many cases there is no access to the gradient information, but only the function values. In this paper, we focus on such a gradient-free setting, and consider the nonconvex-strongly-concave minimax stochastic optimization problem. In the literature, various zeroth-order (i.e., gradient-free) minimax methods have been proposed, but none of them achieve the potentially feasible computational complexity of $O (ϵ^{- 3})$ suggested by the stochastic nonconvex minimization theorem. In this paper, we adopt the variance reduction technique to design a novel zeroth-order variance reduced gradient descent ascent (ZO-VRGDA) algorithm. We show that the ZO-VRGDA algorithm achieves the best known query complexity of $O (κ (d_{1} + d_{2}) ϵ^{- 3})$ , which outperforms all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Complexity and Algorithms in Graphs