A Stochastic Gradient Method with an Exponential Convergence Rate for   Finite Training Sets

Nicolas Le Roux (INRIA Paris - Rocquencourt; LIENS); Mark Schmidt; (INRIA Paris - Rocquencourt; LIENS); Francis Bach (INRIA Paris -; Rocquencourt; LIENS)

arXiv:1202.6258·math.OC·March 12, 2013·540 cites

A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets

Nicolas Le Roux (INRIA Paris - Rocquencourt, LIENS), Mark Schmidt, (INRIA Paris - Rocquencourt, LIENS), Francis Bach (INRIA Paris -, Rocquencourt, LIENS)

PDF

Open Access

TL;DR

This paper introduces a stochastic gradient method with memory that achieves exponential convergence for optimizing strongly convex finite sums, significantly outperforming standard methods in training and test error reduction.

Contribution

The paper presents a novel stochastic gradient algorithm that attains linear convergence by incorporating gradient memory, improving over traditional sublinear methods.

Findings

01

Achieves exponential convergence rate for strongly convex finite sums.

02

Outperforms standard stochastic gradient methods in experiments.

03

Reduces training and test errors more rapidly.

Abstract

We propose a new stochastic gradient method for optimizing the sum of a finite set of smooth functions, where the sum is strongly convex. While standard stochastic gradient methods converge at sublinear rates for this problem, the proposed method incorporates a memory of previous gradient values in order to achieve a linear convergence rate. In a machine learning context, numerical experiments indicate that the new algorithm can dramatically outperform standard algorithms, both in terms of optimizing the training error and reducing the test error quickly.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research