Multi-Agent LLMs for Adaptive Acquisition in Bayesian Optimization
Andrea Carbonati, Mohammadsina Almasi, Hadis Anahideh

TL;DR
This paper investigates how Large Language Models manage exploration and exploitation in Bayesian optimization, proposing a multi-agent framework that improves search effectiveness by separating strategic control from candidate generation.
Contribution
It introduces a multi-agent approach that decomposes exploration-exploitation control into strategic and tactical components, enhancing LLM-based optimization performance.
Findings
Single-agent LLM approaches suffer from cognitive overload and unstable search dynamics.
Decomposing control into strategy and generation agents improves search effectiveness.
Empirical results show significant performance gains across optimization benchmarks.
Abstract
The exploration-exploitation trade-off is central to sequential decision-making and black-box optimization, yet how Large Language Models (LLMs) reason about and manage this trade-off remains poorly understood. Unlike Bayesian Optimization, where exploration and exploitation are explicitly encoded through acquisition functions, LLM-based optimization relies on implicit, prompt-based reasoning over historical evaluations, making search behavior difficult to analyze or control. In this work, we present a metric-level study of LLM-mediated search policy learning, studying how LLMs construct and adapt exploration-exploitation strategies under multiple operational definitions of exploration, including informativeness, diversity, and representativeness. We show that single-agent LLM approaches, which jointly perform strategy selection and candidate generation within a single prompt, suffer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
