Recursive Two-Step Lookahead Expected Payoff for Time-Dependent Bayesian   Optimization

S. Ashwin Renganathan; Jeffrey Larson; Stefan Wild

arXiv:2006.08037·math.OC·December 9, 2020·1 cites

Recursive Two-Step Lookahead Expected Payoff for Time-Dependent Bayesian Optimization

S. Ashwin Renganathan, Jeffrey Larson, Stefan Wild

PDF

Open Access

TL;DR

This paper introduces a recursive two-step lookahead Bayesian optimization method, $ exttt{r2LEY}$, designed for efficient decision-making in time-dependent, expensive-to-evaluate scenarios with a finite horizon.

Contribution

It presents a novel recursive two-step lookahead acquisition function that efficiently approximates multistep planning in time-dependent Bayesian optimization.

Findings

01

$ exttt{r2LEY}$ outperforms myopic methods in time-dependent settings.

02

It effectively balances exploration and exploitation near the horizon.

03

Demonstrated superior performance on synthetic and real-world datasets.

Abstract

We propose a novel Bayesian method to solve the maximization of a time-dependent expensive-to-evaluate oracle. We are interested in the decision that maximizes the oracle at a finite time horizon, when relatively few noisy evaluations can be performed before the horizon. Our recursive, two-step lookahead expected payoff ( $r2LEY$ ) acquisition function makes nonmyopic decisions at every stage by maximizing the estimated expected value of the oracle at the horizon. $r2LEY$ circumvents the evaluation of the expensive multistep (more than two steps) lookahead acquisition function by recursively optimizing a two-step lookahead acquisition function at every stage; unbiased estimators of this latter function and its gradient are utilized for efficient optimization. $r2LEY$ is shown to exhibit natural exploration properties far from the time horizon, enabling accurate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Machine Learning and Algorithms