Linear Convergence of Accelerated Stochastic Gradient Descent for   Nonconvex Nonsmooth Optimization

Feihu Huang; Songcan Chen

arXiv:1704.07953·math.OC·February 18, 2019·2 cites

Linear Convergence of Accelerated Stochastic Gradient Descent for Nonconvex Nonsmooth Optimization

Feihu Huang, Songcan Chen

PDF

Open Access

TL;DR

This paper introduces an accelerated stochastic gradient descent method combining variance reduction and Nesterov's extrapolation, proving its linear convergence to stationary points in nonconvex nonsmooth optimization, supported by numerical experiments.

Contribution

It is the first to establish linear convergence of an accelerated SGD method to local minima in nonconvex nonsmooth problems.

Findings

01

Proved linear convergence of the proposed method.

02

Demonstrated effectiveness through numerical experiments.

03

Established convergence to stationary points.

Abstract

In this paper, we study the stochastic gradient descent (SGD) method for the nonconvex nonsmooth optimization, and propose an accelerated SGD method by combining the variance reduction technique with Nesterov's extrapolation technique. Moreover, based on the local error bound condition, we establish the linear convergence of our method to obtain a stationary point of the nonconvex optimization. In particular, we prove that not only the sequence generated linearly converges to a stationary point of the problem, but also the corresponding sequence of objective values is linearly convergent. Finally, some numerical experiments demonstrate the effectiveness of our method. To the best of our knowledge, it is first proved that the accelerated SGD method converges linearly to the local minimum of the nonconvex optimization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods

MethodsStochastic Gradient Descent