Stochastic Recursive Gradient Algorithm for Nonconvex Optimization

Lam M. Nguyen; Jie Liu; Katya Scheinberg; Martin Tak\'a\v{c}

arXiv:1705.07261·stat.ML·May 23, 2017·67 cites

Stochastic Recursive Gradient Algorithm for Nonconvex Optimization

Lam M. Nguyen, Jie Liu, Katya Scheinberg, Martin Tak\'a\v{c}

PDF

Open Access

TL;DR

This paper introduces a mini-batch stochastic recursive gradient algorithm called SARAH for nonconvex optimization, demonstrating improved convergence rates for general and gradient dominated functions.

Contribution

It provides the first analysis of SARAH's convergence rates for nonconvex loss minimization, including sublinear and linear rates under different conditions.

Findings

01

Sublinear convergence rate to stationary points for general nonconvex functions.

02

Linear convergence rate for gradient dominated functions.

03

Advantages over other stochastic gradient algorithms.

Abstract

In this paper, we study and analyze the mini-batch version of StochAstic Recursive grAdient algoritHm (SARAH), a method employing the stochastic recursive gradient, for solving empirical loss minimization for the case of nonconvex losses. We provide a sublinear convergence rate (to stationary points) for general nonconvex functions and a linear convergence rate for gradient dominated functions, both of which have some advantages compared to other modern stochastic gradient algorithms for nonconvex losses.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods