Reshuffling Resampling Splits Can Improve Generalization of   Hyperparameter Optimization

Thomas Nagler; Lennart Schneider; Bernd Bischl; Matthias Feurer

arXiv:2405.15393·stat.ML·November 11, 2024

Reshuffling Resampling Splits Can Improve Generalization of Hyperparameter Optimization

Thomas Nagler, Lennart Schneider, Bernd Bischl, Matthias Feurer

PDF

Open Access 2 Repos 1 Video

TL;DR

Reshuffling data splits during hyperparameter optimization can significantly enhance the model's generalization performance, especially in holdout protocols, by reducing bias and variance in the validation process.

Contribution

This paper introduces the idea that reshuffling resampling splits during hyperparameter tuning improves generalization, supported by theoretical analysis and extensive experiments.

Findings

01

Reshuffling splits often outperforms fixed splits in generalization.

02

Reshuffling improves results for holdout protocols, making them competitive with cross-validation.

03

Theoretical bounds relate reshuffling benefits to data signal and noise characteristics.

Abstract

Hyperparameter optimization is crucial for obtaining peak performance of machine learning models. The standard protocol evaluates various hyperparameter configurations using a resampling estimate of the generalization error to guide optimization and select a final hyperparameter configuration. Without much evidence, paired resampling splits, i.e., either a fixed train-validation split or a fixed cross-validation scheme, are often recommended. We show that, surprisingly, reshuffling the splits for every configuration often improves the final model's generalization performance on unseen data. Our theoretical analysis explains how reshuffling affects the asymptotic behavior of the validation loss surface and provides a bound on the expected regret in the limiting regime. This bound connects the potential benefits of reshuffling to the signal and noise characteristics of the underlying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Reshuffling Resampling Splits Can Improve Generalization of Hyperparameter Optimization· slideslive

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Machine Learning and Data Classification · Metaheuristic Optimization Algorithms Research