Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient
Hao Di, Haishan Ye, Yueling Zhang, Xiangyu Chang, Guang Dai, Ivor W., Tsang

TL;DR
This paper introduces ZPDVR, a zeroth-order optimization method that reduces variance efficiently without estimating all partial derivatives, achieving optimal query complexity and superior empirical performance.
Contribution
The paper proposes ZPDVR, a novel variance reduction technique for zeroth-order composite optimization that avoids high-dimensional derivative estimation and achieves optimal convergence rates.
Findings
ZPDVR attains the optimal $ ilde{O}(d(n + au) ext{log}(1/\epsilon))$ query complexity.
Empirical results show ZPDVR's linear convergence and superior performance.
ZPDVR reduces both sampling and coordinate-wise variances using an averaging trick.
Abstract
Variance reduction techniques are designed to decrease the sampling variance, thereby accelerating convergence rates of first-order (FO) and zeroth-order (ZO) optimization methods. However, in composite optimization problems, ZO methods encounter an additional variance called the coordinate-wise variance, which stems from the random gradient estimation. To reduce this variance, prior works require estimating all partial derivatives, essentially approximating FO information. This approach demands O(d) function evaluations (d is the dimension size), which incurs substantial computational costs and is prohibitive in high-dimensional scenarios. This paper proposes the Zeroth-order Proximal Double Variance Reduction (ZPDVR) method, which utilizes the averaging trick to reduce both sampling and coordinate-wise variances. Compared to prior methods, ZPDVR relies solely on random gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Radiative Heat Transfer Studies · Sparse and Compressive Sensing Techniques
