Efficient Hyperparameter Optimization of Deep Learning Algorithms Using Deterministic RBF Surrogates
Ilija Ilievski, Taimoor Akhtar, Jiashi Feng, Christine, Annette Shoemaker

TL;DR
This paper introduces HORD, a deterministic radial basis function-based method for hyperparameter optimization in deep learning, which is more efficient than Bayesian approaches, especially in high-dimensional settings.
Contribution
The paper presents HORD, a novel deterministic surrogate-based optimization algorithm that outperforms Bayesian methods in hyperparameter tuning for deep neural networks.
Findings
HORD is over 6 times faster than GP-EI in hyperparameter optimization.
HORD performs exceptionally well in high-dimensional hyperparameter spaces.
Extensive experiments on MNIST and CIFAR-10 validate HORD's efficiency and effectiveness.
Abstract
Automatically searching for optimal hyperparameter configurations is of crucial importance for applying deep learning algorithms in practice. Recently, Bayesian optimization has been proposed for optimizing hyperparameters of various machine learning algorithms. Those methods adopt probabilistic surrogate models like Gaussian processes to approximate and minimize the validation error function of hyperparameter values. However, probabilistic surrogates require accurate estimates of sufficient statistics (e.g., covariance) of the error distribution and thus need many function evaluations with a sizeable number of hyperparameters. This makes them inefficient for optimizing hyperparameters of deep learning algorithms, which are highly expensive to evaluate. In this work, we propose a new deterministic and efficient hyperparameter optimization method that employs radial basis functions as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference
