Fast Stochastic Methods for Nonsmooth Nonconvex Optimization
Sashank J. Reddi, Suvrit Sra, Barnabas Poczos, Alex Smola

TL;DR
This paper introduces fast stochastic algorithms for nonsmooth nonconvex optimization problems, proving convergence to stationary points and demonstrating faster convergence than batch methods, with linear convergence for certain subclasses.
Contribution
It develops the first provably convergent stochastic algorithms for nonsmooth nonconvex problems with constant minibatches, extending previous smooth nonconvex methods.
Findings
Algorithms converge to stationary points with constant minibatches.
Faster convergence than batch proximal gradient descent.
Global linear convergence for specific nonsmooth nonconvex functions.
Abstract
We analyze stochastic algorithms for optimizing nonconvex, nonsmooth finite-sum problems, where the nonconvex part is smooth and the nonsmooth part is convex. Surprisingly, unlike the smooth case, our knowledge of this fundamental problem is very limited. For example, it is not known whether the proximal stochastic gradient method with constant minibatch converges to a stationary point. To tackle this issue, we develop fast stochastic algorithms that provably converge to a stationary point for constant minibatches. Furthermore, using a variant of these algorithms, we show provably faster convergence than batch proximal gradient descent. Finally, we prove global linear convergence rate for an interesting subclass of nonsmooth nonconvex functions, that subsumes several recent works. This paper builds upon our recent series of papers on fast stochastic methods for smooth nonconvex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Complexity and Algorithms in Graphs
