Language Model Embeddings Can Be Sufficient for Bayesian Optimization

Tung Nguyen; Qiuyi Zhang; Bangding Yang; Chansoo Lee; Jorg Bornschein; Yingjie Miao; Sagi Perel; Yutian Chen; Xingyou Song

arXiv:2410.10190·cs.LG·October 10, 2025

Language Model Embeddings Can Be Sufficient for Bayesian Optimization

Tung Nguyen, Qiuyi Zhang, Bangding Yang, Chansoo Lee, Jorg Bornschein, Yingjie Miao, Sagi Perel, Yutian Chen, Xingyou Song

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that language model embeddings of string inputs can serve as effective, flexible regressors in Bayesian Optimization, matching traditional methods in diverse optimization tasks.

Contribution

It introduces a novel approach using LLM embeddings for in-context regression in Bayesian Optimization, enabling general-purpose, domain-agnostic optimization.

Findings

01

Comparable optimization performance to Gaussian Process methods

02

Effective across synthetic, combinatorial, and hyperparameter domains

03

Shows potential for broader application and flexibility

Abstract

Bayesian Optimization is ubiquitous in experimental design and black-box optimization for improving search efficiency. However, most existing approaches rely on regression models which are limited to fixed search spaces and structured, tabular input features. This paper explores the use of LLM embeddings over string inputs for in-context regression in Bayesian Optimization. Our results show that representing inputs as strings enables general-purpose regression across diverse domains, including synthetic, combinatorial, and hyperparameter optimization. Furthermore, our approach achieves optimization performance comparable to state-of-the-art Gaussian Process-based methods such as Google Vizier, and demonstrates potential for broader and more flexible applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/optformer
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques