Random Search Hyper-Parameter Tuning: Expected Improvement Estimation and the Corresponding Lower Bound
Dan Navon, Alex M. Bronstein

TL;DR
This paper introduces a method to estimate the expected accuracy improvement from additional hyperparameter search iterations, providing bounds and aiding in more efficient hyperparameter tuning for neural networks.
Contribution
It establishes the first bound on the expected gain from an extra hyperparameter search iteration, applicable to random search methods, with a practical estimation approach.
Findings
Provides an empirical estimate for expected accuracy improvement
Bounds the estimate with an error of $O(rac{ ootlogk}{k})$ with high probability
Demonstrates the optimal estimate error decreases as $rac{1}{k}$
Abstract
Hyperparameter tuning is a common technique for improving the performance of neural networks. Most techniques for hyperparameter search involve an iterated process where the model is retrained at every iteration. However, the expected accuracy improvement from every additional search iteration, is still unknown. Calculating the expected improvement can help create stopping rules for hyperparameter tuning and allow for a wiser allocation of a project's computational budget. In this paper, we establish an empirical estimate for the expected accuracy improvement from an additional iteration of hyperparameter search. Our results hold for any hyperparameter tuning method which is based on random search \cite{bergstra2012random} and samples hyperparameters from a fixed distribution. We bound our estimate with an error of w.h.p. where is the current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Machine Learning and Algorithms
MethodsRandom Search
