Backtracking Gradient Descent allowing unbounded learning rates
Tuyen Trung Truong

TL;DR
This paper extends convergence analysis of Gradient Descent by allowing unbounded learning rates through a backtracking method, potentially leading to better minima in unconstrained optimization.
Contribution
It introduces a novel unbounded backtracking approach for learning rates in Gradient Descent, proving convergence under general conditions and demonstrating its optimal growth rate.
Findings
Unbounded learning rates can improve convergence to better minima.
The proposed method generalizes previous backtracking algorithms.
The growth rate of the unbounded learning rates is shown to be optimal.
Abstract
In unconstrained optimisation on an Euclidean space, to prove convergence in Gradient Descent processes (GD) it usually is required that the learning rates 's are bounded: for some positive . Under this assumption, if the sequence converges to a critical point , then with large values of the update will be small because . This may also force the sequence to converge to a bad minimum. If we can allow, at least theoretically, that the learning rates 's are not bounded, then we may have better convergence to better minima. A previous joint paper by the author showed convergence for the usual version of Backtracking GD under very general assumptions on the cost function . In this paper, we allow the learning rates to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques
