Bayesian Optimization with Gradients
Jian Wu, Matthias Poloczek, Andrew Gordon Wilson, and Peter I. Frazier

TL;DR
This paper introduces a new Bayesian optimization algorithm, dKG, that effectively utilizes derivative information to reduce the number of function evaluations needed for optimizing complex, expensive functions.
Contribution
The paper develops the derivative-enabled knowledge-gradient (dKG) algorithm, which exploits derivatives in Bayesian optimization, offering theoretical guarantees and improved performance over derivative-free methods.
Findings
dKG achieves state-of-the-art results on various benchmarks.
The method handles noisy and incomplete derivative data.
It reduces the number of function evaluations needed for optimization.
Abstract
Bayesian optimization has been successful at global optimization of expensive-to-evaluate multimodal objective functions. However, unlike most optimization methods, Bayesian optimization typically does not use derivative information. In this paper we show how Bayesian optimization can exploit derivative information to decrease the number of objective function evaluations required for good performance. In particular, we develop a novel Bayesian optimization algorithm, the derivative-enabled knowledge-gradient (dKG), for which we show one-step Bayes-optimality, asymptotic consistency, and greater one-step value of information than is possible in the derivative-free setting. Our procedure accommodates noisy and incomplete derivative information, comes in both sequential and batch forms, and can optionally reduce the computational cost of inference through automatically selected retention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Algorithms
