Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning
Jian Wu, Saul Toscano-Palmerin, Peter I. Frazier, Andrew Gordon, Wilson

TL;DR
This paper introduces a practical multi-fidelity Bayesian optimization method with a novel acquisition function, significantly improving hyperparameter tuning efficiency for deep neural networks and large-scale kernel learning.
Contribution
It presents a flexible approach with a new trace-aware knowledge-gradient acquisition function, optimized for efficiently tuning hyperparameters using multiple fidelity controls.
Findings
Outperforms existing methods in neural network hyperparameter tuning
Efficiently leverages multiple fidelity levels and trace observations
Provides a convergent optimization method for the acquisition function
Abstract
Bayesian optimization is popular for optimizing time-consuming black-box objectives. Nonetheless, for hyperparameter tuning in deep neural networks, the time required to evaluate the validation error for even a few hyperparameter settings remains a bottleneck. Multi-fidelity optimization promises relief using cheaper proxies to such objectives --- for example, validation error for a network trained using a subset of the training points or fewer iterations than required for convergence. We propose a highly flexible and practical approach to multi-fidelity Bayesian optimization, focused on efficiently optimizing hyperparameters for iteratively trained supervised learning models. We introduce a new acquisition function, the trace-aware knowledge-gradient, which efficiently leverages both multiple continuous fidelity controls and trace observations --- values of the objective at a sequence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Heat Transfer and Optimization · Advanced Sensor Technologies Research
