End-to-End Learning of Deep Kernel Acquisition Functions for Bayesian   Optimization

Tomoharu Iwata

arXiv:2111.00639·stat.ML·November 2, 2021·1 cites

End-to-End Learning of Deep Kernel Acquisition Functions for Bayesian Optimization

Tomoharu Iwata

PDF

Open Access

TL;DR

This paper introduces a meta-learning approach for Bayesian optimization that trains neural network-based kernels to directly improve optimization performance across multiple tasks, outperforming existing methods.

Contribution

It proposes a novel meta-learning framework that trains neural network kernels to minimize the gap to the true optimum, enhancing BO performance on unseen tasks.

Findings

01

Outperforms existing BO methods on text datasets

02

Effective transfer of knowledge across tasks

03

Improved optimization accuracy and efficiency

Abstract

For Bayesian optimization (BO) on high-dimensional data with complex structure, neural network-based kernels for Gaussian processes (GPs) have been used to learn flexible surrogate functions by the high representation power of deep learning. However, existing methods train neural networks by maximizing the marginal likelihood, which do not directly improve the BO performance. In this paper, we propose a meta-learning method for BO with neural network-based kernels that minimizes the expected gap between the true optimum value and the best value found by BO. We model a policy, which takes the current evaluated data points as input and outputs the next data point to be evaluated, by a neural network, where neural network-based kernels, GPs, and mutual information-based acquisition functions are used as its layers. With our model, the neural network-based kernel is trained to be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms

MethodsTest