What do you Mean? The Role of the Mean Function in Bayesian Optimisation
George De Ath, Jonathan E. Fieldsend, Richard M. Everson

TL;DR
This paper investigates how the choice of mean function in Gaussian process models affects the convergence of Bayesian optimisation, finding that simple worst-case mean functions often perform best in synthetic problems.
Contribution
It provides an empirical analysis of 8 different mean functions in Bayesian optimisation, highlighting the impact on convergence across synthetic and real-world problems.
Findings
Constant mean equal to the worst observed value improves convergence in synthetic problems.
More complex mean functions may be beneficial for real-world tasks.
Choice of mean function significantly influences optimisation performance.
Abstract
Bayesian optimisation is a popular approach for optimising expensive black-box functions. The next location to be evaluated is selected via maximising an acquisition function that balances exploitation and exploration. Gaussian processes, the surrogate models of choice in Bayesian optimisation, are often used with a constant prior mean function equal to the arithmetic mean of the observed function values. We show that the rate of convergence can depend sensitively on the choice of mean function. We empirically investigate 8 mean functions (constant functions equal to the arithmetic mean, minimum, median and maximum of the observed function evaluations, linear, quadratic polynomials, random forests and RBF networks), using 10 synthetic test problems and two real-world problems, and using the Expected Improvement and Upper Confidence Bound acquisition functions. We find that for design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research
