Reshuffling Resampling Splits Can Improve Generalization of Hyperparameter Optimization
Thomas Nagler, Lennart Schneider, Bernd Bischl, Matthias Feurer

TL;DR
Reshuffling data splits during hyperparameter optimization can significantly enhance the model's generalization performance, especially in holdout protocols, by reducing bias and variance in the validation process.
Contribution
This paper introduces the idea that reshuffling resampling splits during hyperparameter tuning improves generalization, supported by theoretical analysis and extensive experiments.
Findings
Reshuffling splits often outperforms fixed splits in generalization.
Reshuffling improves results for holdout protocols, making them competitive with cross-validation.
Theoretical bounds relate reshuffling benefits to data signal and noise characteristics.
Abstract
Hyperparameter optimization is crucial for obtaining peak performance of machine learning models. The standard protocol evaluates various hyperparameter configurations using a resampling estimate of the generalization error to guide optimization and select a final hyperparameter configuration. Without much evidence, paired resampling splits, i.e., either a fixed train-validation split or a fixed cross-validation scheme, are often recommended. We show that, surprisingly, reshuffling the splits for every configuration often improves the final model's generalization performance on unseen data. Our theoretical analysis explains how reshuffling affects the asymptotic behavior of the validation loss surface and provides a bound on the expected regret in the limiting regime. This bound connects the potential benefits of reshuffling to the signal and noise characteristics of the underlying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Machine Learning and Data Classification · Metaheuristic Optimization Algorithms Research
