No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity   Constraints

Arpan Losalka; Jonathan Scarlett

arXiv:2406.03264·stat.ML·June 6, 2024

No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints

Arpan Losalka, Jonathan Scarlett

PDF

Open Access

TL;DR

This paper introduces a new algorithm for safe Bayesian optimization that achieves low cumulative regret when the safety function is monotone in a specific variable, addressing a key challenge in safe exploration.

Contribution

The paper proposes a novel algorithm that attains sublinear regret in safe Bayesian optimization under monotonicity constraints on the safety function, expanding the theoretical understanding of safe exploration.

Findings

01

Achieves sublinear regret with monotone safety functions

02

Supports finding near-optimal actions for each context

03

Empirical results validate theoretical claims

Abstract

We consider the problem of sequentially maximizing an unknown function $f$ over a set of actions of the form $(s, x)$ , where the selected actions must satisfy a safety constraint with respect to an unknown safety function $g$ . We model $f$ and $g$ as lying in a reproducing kernel Hilbert space (RKHS), which facilitates the use of Gaussian process methods. While existing works for this setting have provided algorithms that are guaranteed to identify a near-optimal safe action, the problem of attaining low cumulative regret has remained largely unexplored, with a key challenge being that expanding the safe region can incur high regret. To address this challenge, we show that if $g$ is monotone with respect to just the single variable $s$ (with no such constraint on $f$ ), sublinear regret becomes achievable with our proposed algorithm. In addition, we show that a modified version…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Machine Learning and Algorithms · Advanced Statistical Process Monitoring

MethodsSparse Evolutionary Training · Gaussian Process