Fast Hyperparameter Optimization of Deep Neural Networks via Ensembling Multiple Surrogates
Yang Li, Jiawei Jiang, Yingxia Shao, Bin Cui

TL;DR
This paper introduces HOIST, a hyperparameter optimization method that leverages both complete and intermediate evaluation data using ensemble surrogates to improve efficiency in tuning deep neural networks.
Contribution
HOIST is a novel approach that combines multiple surrogates trained on mixed evaluation data, including early stopping information, to accelerate DNN hyperparameter tuning.
Findings
HOIST outperforms state-of-the-art methods across various DNN architectures.
Utilizes both complete and intermediate evaluation data for better surrogate modeling.
Ensemble surrogates improve the accuracy and efficiency of hyperparameter optimization.
Abstract
The performance of deep neural networks crucially depends on good hyperparameter configurations. Bayesian optimization is a powerful framework for optimizing the hyperparameters of DNNs. These methods need sufficient evaluation data to approximate and minimize the validation error function of hyperparameters. However, the expensive evaluation cost of DNNs leads to very few evaluation data within a limited time, which greatly reduces the efficiency of Bayesian optimization. Besides, the previous researches focus on using the complete evaluation data to conduct Bayesian optimization, and ignore the intermediate evaluation data generated by early stopping methods. To alleviate the insufficient evaluation data problem, we propose a fast hyperparameter optimization method, HOIST, that utilizes both the complete and intermediate evaluation data to accelerate the hyperparameter optimization of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Advanced Multi-Objective Optimization Algorithms
MethodsEarly Stopping
