Overtuning in Hyperparameter Optimization
Lennart Schneider, Bernd Bischl, Matthias Feurer

TL;DR
This paper investigates the phenomenon of overtuning in hyperparameter optimization, revealing that excessive tuning can lead to worse generalization, and emphasizes the need for awareness and mitigation strategies in HPO practices.
Contribution
It formally defines overtuning in HPO, analyzes its prevalence through large-scale reanalysis, and discusses factors influencing it along with potential mitigation strategies.
Findings
Overtuning is more common than previously thought, often mild but sometimes severe.
In about 10% of cases, overtuning results in worse generalization than default configurations.
Factors like dataset size and resampling strategy significantly affect overtuning prevalence.
Abstract
Hyperparameter optimization (HPO) aims to identify an optimal hyperparameter configuration (HPC) such that the resulting model generalizes well to unseen data. As the expected generalization error cannot be optimized directly, it is estimated with a resampling strategy, such as holdout or cross-validation. This approach implicitly assumes that minimizing the validation error leads to improved generalization. However, since validation error estimates are inherently stochastic and depend on the resampling strategy, a natural question arises: Can excessive optimization of the validation error lead to overfitting at the HPO level, akin to overfitting in model training based on empirical risk minimization? In this paper, we investigate this phenomenon, which we term overtuning, a form of overfitting specific to HPO. Despite its practical relevance, overtuning has received limited attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms
MethodsHyper-parameter optimization
