Faster Gradient-Free Proximal Stochastic Methods for Nonconvex Nonsmooth   Optimization

Feihu Huang; Bin Gu; Zhouyuan Huo; Songcan Chen; Heng Huang

arXiv:1902.06158·math.OC·February 19, 2019·1 cites

Faster Gradient-Free Proximal Stochastic Methods for Nonconvex Nonsmooth Optimization

Feihu Huang, Bin Gu, Zhouyuan Huo, Songcan Chen, Heng Huang

PDF

Open Access

TL;DR

This paper introduces faster zeroth-order proximal stochastic algorithms with variance reduction techniques for nonconvex nonsmooth optimization, achieving improved convergence rates over previous methods.

Contribution

The paper develops ZO-ProxSVRG and ZO-ProxSAGA algorithms with $O(1/T)$ convergence rates, addressing the unbiased gradient estimation challenge in zeroth-order methods.

Findings

01

Algorithms outperform existing zeroth-order methods in convergence speed.

02

Theoretical proof of $O(1/T)$ convergence rate.

03

Experimental results confirm faster convergence.

Abstract

Proximal gradient method has been playing an important role to solve many machine learning tasks, especially for the nonsmooth problems. However, in some machine learning problems such as the bandit model and the black-box learning problem, proximal gradient method could fail because the explicit gradients of these problems are difficult or infeasible to obtain. The gradient-free (zeroth-order) method can address these problems because only the objective function values are required in the optimization. Recently, the first zeroth-order proximal stochastic algorithm was proposed to solve the nonconvex nonsmooth problems. However, its convergence rate is $O (\frac{1}{T})$ for the nonconvex problems, which is significantly slower than the best convergence rate $O (\frac{1}{T})$ of the zeroth-order stochastic algorithm, where $T$ is the iteration number. To fill this gap, in the paper,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research

MethodsSAGA