Probabilistic Line Searches for Stochastic Optimization

Maren Mahsereci; Philipp Hennig

arXiv:1502.02846·cs.LG·January 19, 2016·25 cites

Probabilistic Line Searches for Stochastic Optimization

Maren Mahsereci, Philipp Hennig

PDF

Open Access 1 Repo

TL;DR

This paper introduces a probabilistic line search method for stochastic optimization that uses Bayesian techniques to adaptively determine step sizes, removing the need for manual learning rate tuning.

Contribution

It develops a novel probabilistic line search algorithm combining Gaussian process surrogates with Wolfe condition beliefs, applicable to stochastic gradient descent.

Findings

01

Effectively removes the need for manual learning rate tuning.

02

Maintains low computational cost and no user parameters.

03

Improves stability and efficiency in stochastic optimization.

Abstract

In deterministic optimization, line searches are a standard tool ensuring stability and efficiency. Where only stochastic gradients are available, no direct equivalent has so far been formulated, because uncertain gradients do not allow for a strict sequence of decisions collapsing the search space. We construct a probabilistic line search by combining the structure of existing deterministic methods with notions from Bayesian optimization. Our method retains a Gaussian process surrogate of the univariate optimization objective, and uses a probabilistic belief over the Wolfe conditions to monitor the descent. The algorithm has very low computational cost, and no user-controlled parameters. Experiments show that it effectively removes the need to define a learning rate for stochastic gradient descent.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ProbabilisticNumerics/probabilistic_line_search
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Advanced Bandit Algorithms Research

MethodsGaussian Process