Standard Gaussian Process is All You Need for High-Dimensional Bayesian Optimization
Zhitong Xu, Haitao Wang, Jeff M Phillips, Shandian Zhe

TL;DR
This paper challenges the belief that standard Gaussian process-based Bayesian Optimization underperforms in high dimensions, showing that with proper kernel choice and initialization, it can be highly effective.
Contribution
The study provides empirical evidence, theoretical analysis, and a simple initialization method that significantly improve high-dimensional Bayesian Optimization using standard GPs.
Findings
Matérn kernels outperform SE kernels in high dimensions.
Proper initialization mitigates gradient vanishing issues.
Standard BO can be competitive with specialized high-dimensional methods.
Abstract
A long-standing belief holds that Bayesian Optimization (BO) with standard Gaussian processes (GP) -- referred to as standard BO -- underperforms in high-dimensional optimization problems. While this belief seems plausible, it lacks both robust empirical evidence and theoretical justification. To address this gap, we present a systematic investigation. First, through a comprehensive evaluation across twelve benchmarks, we found that while the popular Square Exponential (SE) kernel often leads to poor performance, using Mat\'ern kernels enables standard BO to consistently achieve top-tier results, frequently surpassing methods specifically designed for high-dimensional optimization. Second, our theoretical analysis reveals that the SE kernel's failure primarily stems from improper initialization of the length-scale parameters, which are commonly used in practice but can cause gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference
MethodsGaussian Process · Greedy Policy Search
