Approximately Exact Line Search
Sara Fridovich-Keil, Benjamin Recht

TL;DR
The paper introduces approximately exact line search (AELS), a function evaluation-based method that accelerates convergence in various optimization algorithms, especially in stochastic settings, with proven bounds and empirical speedups.
Contribution
AELS is a novel line search method that guarantees near-optimal step sizes using only function evaluations, with proven convergence bounds and practical efficiency.
Findings
AELS achieves linear convergence on smooth, strongly convex functions.
AELS outperforms Armijo line searches in stochastic logistic regression.
Wolfe line search is slightly faster on derivative-free optimization benchmarks.
Abstract
We propose approximately exact line search (AELS), which uses only function evaluations to select a step size within a constant fraction of the exact line search minimizer of a unimodal objective. We bound the number of iterations and function evaluations of AELS, showing linear convergence on smooth, strongly convex objectives with no dependence on the initial step size for three descent methods: steepest descent using the true gradient, approximate steepest descent using a gradient approximation, and random direction search. We demonstrate experimental speedup compared to Armijo line searches and other baselines on weakly regularized logistic regression for both gradient descent and minibatch stochastic gradient descent and on a benchmark set of derivative-free optimization objectives using quasi-Newton search directions. We also analyze a simple line search for the strong Wolfe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
