Scalable First-Order Bayesian Optimization via Structured Automatic Differentiation
Sebastian Ament, Carla Gomes

TL;DR
This paper introduces a scalable approach to first-order Bayesian Optimization that leverages structured automatic differentiation to efficiently incorporate gradient and Hessian information, enabling high-dimensional optimization.
Contribution
It develops a structure-aware automatic differentiation method that exploits kernel matrix structures for scalable gradient and Hessian computations in Bayesian Optimization.
Findings
Achieves $ ext{O}(n^2d)$ matrix-vector multiply for gradient observations.
Enables automatic extension to complex kernels like neural networks and spectral mixtures.
Scales first-order Bayesian Optimization to high-dimensional problems.
Abstract
Bayesian Optimization (BO) has shown great promise for the global optimization of functions that are expensive to evaluate, but despite many successes, standard approaches can struggle in high dimensions. To improve the performance of BO, prior work suggested incorporating gradient information into a Gaussian process surrogate of the objective, giving rise to kernel matrices of size for observations in dimensions. Na\"ively multiplying with (resp. inverting) these matrices requires (resp. )) operations, which becomes infeasible for moderate dimensions and sample sizes. Here, we observe that a wide range of kernels gives rise to structured matrices, enabling an exact matrix-vector multiply for gradient observations and for Hessian observations. Beyond canonical kernel classes, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Algorithms · Advanced Bandit Algorithms Research
MethodsGaussian Process
