Backtracking Gradient Descent allowing unbounded learning rates

Tuyen Trung Truong

arXiv:2001.02005·math.OC·January 9, 2020·6 cites

Backtracking Gradient Descent allowing unbounded learning rates

Tuyen Trung Truong

PDF

Open Access

TL;DR

This paper extends convergence analysis of Gradient Descent by allowing unbounded learning rates through a backtracking method, potentially leading to better minima in unconstrained optimization.

Contribution

It introduces a novel unbounded backtracking approach for learning rates in Gradient Descent, proving convergence under general conditions and demonstrating its optimal growth rate.

Findings

01

Unbounded learning rates can improve convergence to better minima.

02

The proposed method generalizes previous backtracking algorithms.

03

The growth rate of the unbounded learning rates is shown to be optimal.

Abstract

In unconstrained optimisation on an Euclidean space, to prove convergence in Gradient Descent processes (GD) $x_{n + 1} = x_{n} - δ_{n} \nabla f (x_{n})$ it usually is required that the learning rates $δ_{n}$ 's are bounded: $δ_{n} \leq δ$ for some positive $δ$ . Under this assumption, if the sequence $x_{n}$ converges to a critical point $z$ , then with large values of $n$ the update will be small because $∣∣ x_{n + 1} - x_{n} ∣∣ ≲ ∣∣\nabla f (x_{n}) ∣∣$ . This may also force the sequence to converge to a bad minimum. If we can allow, at least theoretically, that the learning rates $δ_{n}$ 's are not bounded, then we may have better convergence to better minima. A previous joint paper by the author showed convergence for the usual version of Backtracking GD under very general assumptions on the cost function $f$ . In this paper, we allow the learning rates $δ_{n}$ to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques