Freeze-Thaw Bayesian Optimization

Kevin Swersky; Jasper Snoek; Ryan Prescott Adams

arXiv:1406.3896·stat.ML·June 17, 2014·117 cites

Freeze-Thaw Bayesian Optimization

Kevin Swersky, Jasper Snoek, Ryan Prescott Adams

PDF

Open Access 1 Repo

TL;DR

This paper introduces a dynamic Bayesian optimization method that leverages partial training information to efficiently select hyperparameters by deciding when to pause, resume, or restart training, improving hyperparameter tuning speed.

Contribution

It develops a novel covariance kernel and Gaussian process prior tailored for training curves, along with an information-theoretic framework for decision-making in hyperparameter optimization.

Findings

01

Effective in practice across multiple models

02

Reduces training time for hyperparameter tuning

03

Automates decision process with information theory

Abstract

In this paper we develop a dynamic form of Bayesian optimization for machine learning models with the goal of rapidly finding good hyperparameter settings. Our method uses the partial information gained during the training of a machine learning model in order to decide whether to pause training and start a new model, or resume the training of a previously-considered model. We specifically tailor our method to machine learning problems by developing a novel positive-definite covariance kernel to capture a variety of training curves. Furthermore, we develop a Gaussian process prior that scales gracefully with additional temporal observations. Finally, we provide an information-theoretic framework to automate the decision process. Experiments on several common machine learning models show that our approach is extremely effective in practice.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jamesrobertlloyd/automl-phase-2
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference

MethodsGaussian Process