Linearly convergent stochastic heavy ball method for minimizing   generalization error

Nicolas Loizou; Peter Richt\'arik

arXiv:1710.10737·math.OC·December 27, 2017·26 cites

Linearly convergent stochastic heavy ball method for minimizing generalization error

Nicolas Loizou, Peter Richt\'arik

PDF

Open Access

TL;DR

This paper proves the first linear convergence rate for a stochastic heavy ball method using fixed stepsize and momentum, specifically for minimizing expected loss in quadratic settings, expanding understanding of its efficiency.

Contribution

It introduces the first linear convergence analysis for the stochastic heavy ball method on expected loss minimization with fixed stepsize and momentum.

Findings

01

First linear convergence result established for stochastic heavy ball method.

02

Analysis focuses on expected loss, not finite-sum problems.

03

Applicable to quadratic loss, even if the overall objective isn't strongly convex.

Abstract

In this work we establish the first linear convergence result for the stochastic heavy ball method. The method performs SGD steps with a fixed stepsize, amended by a heavy ball momentum term. In the analysis, we focus on minimizing the expected loss and not on finite-sum minimization, which is typically a much harder problem. While in the analysis we constrain ourselves to quadratic loss, the overall objective is not necessarily strongly convex.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition

MethodsStochastic Gradient Descent