A Simple Proximal Stochastic Gradient Method for Nonsmooth Nonconvex Optimization
Zhize Li, Jian Li

TL;DR
This paper introduces ProxSVRG+, a variance-reduced proximal stochastic gradient method for nonsmooth nonconvex optimization, which improves convergence guarantees, reduces oracle calls, and adapts to local PL conditions, outperforming existing algorithms.
Contribution
ProxSVRG+ is a novel variance reduction algorithm that generalizes and improves upon existing proximal stochastic gradient methods for nonconvex nonsmooth problems.
Findings
ProxSVRG+ outperforms ProxGD and ProxSVRG in experiments.
ProxSVRG+ achieves linear convergence under Polyak-ojasiewicz condition.
The algorithm reduces the number of proximal oracle calls compared to prior methods.
Abstract
We analyze stochastic gradient algorithms for optimizing nonconvex, nonsmooth finite-sum problems. In particular, the objective function is given by the summation of a differentiable (possibly nonconvex) component, together with a possibly non-differentiable but convex component. We propose a proximal stochastic gradient algorithm based on variance reduction, called ProxSVRG+. Our main contribution lies in the analysis of ProxSVRG+. It recovers several existing convergence results and improves/generalizes them (in terms of the number of stochastic gradient oracle calls and proximal oracle calls). In particular, ProxSVRG+ generalizes the best results given by the SCSG algorithm, recently proposed by [Lei et al., 2017] for the smooth nonconvex case. ProxSVRG+ is also more straightforward than SCSG and yields simpler analysis. Moreover, ProxSVRG+ outperforms the deterministic proximal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
