Recursive Two-Step Lookahead Expected Payoff for Time-Dependent Bayesian Optimization
S. Ashwin Renganathan, Jeffrey Larson, Stefan Wild

TL;DR
This paper introduces a recursive two-step lookahead Bayesian optimization method, $ exttt{r2LEY}$, designed for efficient decision-making in time-dependent, expensive-to-evaluate scenarios with a finite horizon.
Contribution
It presents a novel recursive two-step lookahead acquisition function that efficiently approximates multistep planning in time-dependent Bayesian optimization.
Findings
$ exttt{r2LEY}$ outperforms myopic methods in time-dependent settings.
It effectively balances exploration and exploitation near the horizon.
Demonstrated superior performance on synthetic and real-world datasets.
Abstract
We propose a novel Bayesian method to solve the maximization of a time-dependent expensive-to-evaluate oracle. We are interested in the decision that maximizes the oracle at a finite time horizon, when relatively few noisy evaluations can be performed before the horizon. Our recursive, two-step lookahead expected payoff () acquisition function makes nonmyopic decisions at every stage by maximizing the estimated expected value of the oracle at the horizon. circumvents the evaluation of the expensive multistep (more than two steps) lookahead acquisition function by recursively optimizing a two-step lookahead acquisition function at every stage; unbiased estimators of this latter function and its gradient are utilized for efficient optimization. is shown to exhibit natural exploration properties far from the time horizon, enabling accurate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms
