BOHB: Robust and Efficient Hyperparameter Optimization at Scale

Stefan Falkner; Aaron Klein; Frank Hutter

arXiv:1807.01774·cs.LG·July 6, 2018·180 cites

BOHB: Robust and Efficient Hyperparameter Optimization at Scale

Stefan Falkner, Aaron Klein, Frank Hutter

PDF

Open Access 4 Repos

TL;DR

BOHB combines Bayesian optimization and bandit-based methods to deliver robust, efficient, and scalable hyperparameter tuning, outperforming existing approaches across diverse machine learning tasks.

Contribution

The paper introduces BOHB, a novel hyperparameter optimization method that merges Bayesian and bandit strategies for improved performance and scalability.

Findings

01

BOHB outperforms Bayesian optimization and Hyperband on various tasks.

02

BOHB is robust and versatile across different models and problem types.

03

BOHB is simple to implement and computationally efficient.

Abstract

Modern deep learning methods are very sensitive to many hyperparameters, and, due to the long training times of state-of-the-art models, vanilla Bayesian hyperparameter optimization is typically computationally infeasible. On the other hand, bandit-based configuration evaluation approaches based on random search lack guidance and do not converge to the best configurations as quickly. Here, we propose to combine the benefits of both Bayesian optimization and bandit-based methods, in order to achieve the best of both worlds: strong anytime performance and fast convergence to optimal configurations. We propose a new practical state-of-the-art hyperparameter optimization method, which consistently outperforms both Bayesian optimization and Hyperband on a wide range of problem types, including high-dimensional toy functions, support vector machines, feed-forward neural networks, Bayesian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Advanced Bandit Algorithms Research