Scenario Approach for Robust Blackbox Optimization in the Bandit Setting

Shaunak D. Bopardikar; Vaibhav Srivastava

arXiv:1804.10932·math.OC·May 1, 2018

Scenario Approach for Robust Blackbox Optimization in the Bandit Setting

Shaunak D. Bopardikar, Vaibhav Srivastava

PDF

Open Access

TL;DR

This paper introduces a scenario-based approach for robust blackbox optimization in bandit settings, leveraging Gaussian Process models and UCB strategies to bound and analyze regret under uncertain parameters.

Contribution

It presents a novel iterative algorithm using scenario sampling and UCB for robust blackbox optimization, with theoretical regret bounds and asymptotic analysis.

Findings

01

High probability upper bounds on regret under re-draw.

02

Conditions for regret to asymptotically tend to zero.

03

Numerical results demonstrating algorithm performance.

Abstract

This paper discusses a scenario approach to robust optimization of a blackbox function in a bandit setting. We assume that the blackbox function can be modeled as a Gaussian Process (GP) for every realization of the uncertain parameter. We adopt a scenario approach in which we draw fixed independent samples of the uncertain parameter. For a given policy, i.e., a sequence of query points and uncertain parameters in the sampled set, we introduce a notion of regret defined with respect to additional draws of the uncertain parameter, termed as scenario regret under re-draw. We present a scenario-based iterative algorithm using the upper confidence bound (UCB) of the fixed independent scenarios to compute a policy for the blackbox optimization. For this algorithm, we characterize a high probability upper bound on the regret under re-draw for any finite number of iterations of the algorithm.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management · Risk and Portfolio Optimization