Tabular Benchmarks for Joint Architecture and Hyperparameter Optimization
Aaron Klein, Frank Hutter

TL;DR
This paper introduces efficient, realistic benchmarks for evaluating hyperparameter optimization methods on neural networks, enabling reproducible and cost-effective empirical comparisons.
Contribution
It provides a set of neural network benchmarks with diverse configurations for systematic evaluation of HPO methods, including an analysis of hyperparameter importance and method performance.
Findings
Benchmarks are computationally cheap and realistic.
Different HPO methods show varying performance and robustness.
Hyperparameters have differing impacts on optimization outcomes.
Abstract
Due to the high computational demands executing a rigorous comparison between hyperparameter optimization (HPO) methods is often cumbersome. The goal of this paper is to facilitate a better empirical evaluation of HPO methods by providing benchmarks that are cheap to evaluate, but still represent realistic use cases. We believe these benchmarks provide an easy and efficient way to conduct reproducible experiments for neural hyperparameter search. Our benchmarks consist of a large grid of configurations of a feed forward neural network on four different regression datasets including architectural hyperparameters and hyperparameters concerning the training pipeline. Based on this data, we performed an in-depth analysis to gain a better understanding of the properties of the optimization problem, as well as of the importance of different types of hyperparameters. Second, we exhaustively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms · Metaheuristic Optimization Algorithms Research
MethodsHyper-parameter optimization
