Default Machine Learning Hyperparameters Do Not Provide Informative Initialization for Bayesian Optimization
Nicol\'as Villagr\'an Prieto, Eduardo C. Garrido-Merch\'an

TL;DR
This study rigorously tests whether using default hyperparameters as initial points in Bayesian Optimization improves tuning efficiency, finding no significant advantage over random initialization across multiple models and datasets.
Contribution
It provides empirical evidence that default hyperparameters do not offer informative starting points for Bayesian Optimization, challenging common assumptions.
Findings
Default-informed initialization shows no significant performance gain.
Tighter priors around defaults only benefit early evaluations, not final results.
Practitioners should prioritize data-driven tuning over default hyperparameter reliance.
Abstract
Bayesian Optimization (BO) is a standard tool for hyperparameter tuning thanks to its sample efficiency on expensive black-box functions. While most BO pipelines begin with uniform random initialization, default hyperparameter values shipped with popular ML libraries such as scikit-learn encode implicit expert knowledge and could serve as informative starting points that accelerate convergence. This hypothesis, despite its intuitive appeal, has remained largely unexamined. We formalize the idea by initializing BO with points drawn from truncated Gaussian distributions centered at library defaults and compare the resulting trajectories against a uniform-random baseline. We conduct an extensive empirical evaluation spanning three BO back-ends (BoTorch, Optuna, Scikit-Optimize), three model families (Random Forests, Support Vector Machines, Multilayer Perceptrons), and five benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference
