Default Machine Learning Hyperparameters Do Not Provide Informative Initialization for Bayesian Optimization

Nicol\'as Villagr\'an Prieto; Eduardo C. Garrido-Merch\'an

arXiv:2602.08774·cs.LG·February 10, 2026

Default Machine Learning Hyperparameters Do Not Provide Informative Initialization for Bayesian Optimization

Nicol\'as Villagr\'an Prieto, Eduardo C. Garrido-Merch\'an

PDF

Open Access

TL;DR

This study rigorously tests whether using default hyperparameters as initial points in Bayesian Optimization improves tuning efficiency, finding no significant advantage over random initialization across multiple models and datasets.

Contribution

It provides empirical evidence that default hyperparameters do not offer informative starting points for Bayesian Optimization, challenging common assumptions.

Findings

01

Default-informed initialization shows no significant performance gain.

02

Tighter priors around defaults only benefit early evaluations, not final results.

03

Practitioners should prioritize data-driven tuning over default hyperparameter reliance.

Abstract

Bayesian Optimization (BO) is a standard tool for hyperparameter tuning thanks to its sample efficiency on expensive black-box functions. While most BO pipelines begin with uniform random initialization, default hyperparameter values shipped with popular ML libraries such as scikit-learn encode implicit expert knowledge and could serve as informative starting points that accelerate convergence. This hypothesis, despite its intuitive appeal, has remained largely unexamined. We formalize the idea by initializing BO with points drawn from truncated Gaussian distributions centered at library defaults and compare the resulting trajectories against a uniform-random baseline. We conduct an extensive empirical evaluation spanning three BO back-ends (BoTorch, Optuna, Scikit-Optimize), three model families (Random Forests, Support Vector Machines, Multilayer Perceptrons), and five benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference