Direct Regret Optimization in Bayesian Optimization
Fengxue Zhang, Yuxin Chen

TL;DR
This paper introduces a direct regret optimization method for Bayesian optimization that uses simulated trajectories and a decision transformer to improve multi-step decision making, outperforming traditional BO approaches.
Contribution
It presents a novel framework combining ensemble GPs, simulated trajectories, and a decision transformer to directly optimize multi-step regret in Bayesian optimization.
Findings
Outperforms baseline BO methods in synthetic and real-world benchmarks.
Achieves lower simple regret and more robust exploration.
Effective in high-dimensional and noisy settings.
Abstract
Bayesian optimization (BO) is a powerful paradigm for optimizing expensive black-box functions. Traditional BO methods typically rely on separate hand-crafted acquisition functions and surrogate models for the underlying function, and often operate in a myopic manner. In this paper, we propose a novel direct regret optimization approach that jointly learns the optimal model and non-myopic acquisition by distilling from a set of candidate models and acquisitions, and explicitly targets minimizing the multi-step regret. Our framework leverages an ensemble of Gaussian Processes (GPs) with varying hyperparameters to generate simulated BO trajectories, each guided by an acquisition function chosen from a pool of conventional choices, until a Bayesian early stop criterion is met. These simulated trajectories, capturing multi-step exploration strategies, are used to train an end-to-end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification
