Faster Gradient-Free Proximal Stochastic Methods for Nonconvex Nonsmooth Optimization
Feihu Huang, Bin Gu, Zhouyuan Huo, Songcan Chen, Heng Huang

TL;DR
This paper introduces faster zeroth-order proximal stochastic algorithms with variance reduction techniques for nonconvex nonsmooth optimization, achieving improved convergence rates over previous methods.
Contribution
The paper develops ZO-ProxSVRG and ZO-ProxSAGA algorithms with $O(1/T)$ convergence rates, addressing the unbiased gradient estimation challenge in zeroth-order methods.
Findings
Algorithms outperform existing zeroth-order methods in convergence speed.
Theoretical proof of $O(1/T)$ convergence rate.
Experimental results confirm faster convergence.
Abstract
Proximal gradient method has been playing an important role to solve many machine learning tasks, especially for the nonsmooth problems. However, in some machine learning problems such as the bandit model and the black-box learning problem, proximal gradient method could fail because the explicit gradients of these problems are difficult or infeasible to obtain. The gradient-free (zeroth-order) method can address these problems because only the objective function values are required in the optimization. Recently, the first zeroth-order proximal stochastic algorithm was proposed to solve the nonconvex nonsmooth problems. However, its convergence rate is for the nonconvex problems, which is significantly slower than the best convergence rate of the zeroth-order stochastic algorithm, where is the iteration number. To fill this gap, in the paper,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
MethodsSAGA
