Bayesian Optimization with Gradients

Jian Wu; Matthias Poloczek; Andrew Gordon Wilson; and Peter I. Frazier

arXiv:1703.04389·stat.ML·February 8, 2018·96 cites

Bayesian Optimization with Gradients

Jian Wu, Matthias Poloczek, Andrew Gordon Wilson, and Peter I. Frazier

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new Bayesian optimization algorithm, dKG, that effectively utilizes derivative information to reduce the number of function evaluations needed for optimizing complex, expensive functions.

Contribution

The paper develops the derivative-enabled knowledge-gradient (dKG) algorithm, which exploits derivatives in Bayesian optimization, offering theoretical guarantees and improved performance over derivative-free methods.

Findings

01

dKG achieves state-of-the-art results on various benchmarks.

02

The method handles noisy and incomplete derivative data.

03

It reduces the number of function evaluations needed for optimization.

Abstract

Bayesian optimization has been successful at global optimization of expensive-to-evaluate multimodal objective functions. However, unlike most optimization methods, Bayesian optimization typically does not use derivative information. In this paper we show how Bayesian optimization can exploit derivative information to decrease the number of objective function evaluations required for good performance. In particular, we develop a novel Bayesian optimization algorithm, the derivative-enabled knowledge-gradient (dKG), for which we show one-step Bayes-optimality, asymptotic consistency, and greater one-step value of information than is possible in the derivative-free setting. Our procedure accommodates noisy and incomplete derivative information, comes in both sequential and batch forms, and can optionally reduce the computational cost of inference through automatically selected retention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wujian16/Cornell-MOE
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Algorithms