Exploiting gradients and Hessians in Bayesian optimization and Bayesian quadrature
Anqi Wu, Mikio C. Aoi, Jonathan W. Pillow

TL;DR
This paper extends Bayesian optimization and quadrature methods to incorporate derivative information, demonstrating that using gradients and Hessians significantly improves efficiency and accuracy in function learning, optimization, and integration tasks.
Contribution
It introduces novel techniques for leveraging derivatives in Gaussian process models for Bayesian optimization and quadrature, addressing previous ill-conditioning issues and demonstrating substantial performance gains.
Findings
Derivative information accelerates hyperparameter convergence.
Gradient-enhanced methods outperform standard Gaussian processes.
Applications show improved optimization and integration results.
Abstract
An exciting branch of machine learning research focuses on methods for learning, optimizing, and integrating unknown functions that are difficult or costly to evaluate. A popular Bayesian approach to this problem uses a Gaussian process (GP) to construct a posterior distribution over the function of interest given a set of observed measurements, and selects new points to evaluate using the statistics of this posterior. Here we extend these methods to exploit derivative information from the unknown function. We describe methods for Bayesian optimization (BO) and Bayesian quadrature (BQ) in settings where first and second derivatives may be evaluated along with the function itself. We perform sampling-based inference in order to incorporate uncertainty over hyperparameters, and show that both hyperparameter and function uncertainty decrease much more rapidly when using derivative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Algorithms
