BOSS: Bayesian Optimization over String Spaces
Henry B. Moss, Daniel Beck, Javier Gonzalez, David S. Leslie, Paul, Rayson

TL;DR
This paper introduces BOSS, a Bayesian optimization method that directly optimizes over raw strings using string kernels and genetic algorithms, avoiding complex latent space mappings and improving efficiency and effectiveness.
Contribution
It is the first to integrate string kernels and genetic algorithms within Bayesian optimization loops for direct string space optimization.
Findings
Significantly outperforms existing string optimization methods.
Supports variable length inputs and syntactical constraints.
Effective in grammar-governed syntax scenarios.
Abstract
This article develops a Bayesian optimization (BO) method which acts directly over raw strings, proposing the first uses of string kernels and genetic algorithms within BO loops. Recent applications of BO over strings have been hindered by the need to map inputs into a smooth and unconstrained latent space. Learning this projection is computationally and data-intensive. Our approach instead builds a powerful Gaussian process surrogate model based on string kernels, naturally supporting variable length inputs, and performs efficient acquisition function maximization for spaces with syntactical constraints. Experiments demonstrate considerably improved optimization over existing approaches across a broad range of constraints, including the popular setting where syntax is governed by a context-free grammar.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
MethodsGaussian Process
