No-Regret Algorithms for Safe Bayesian Optimization with Monotonicity Constraints
Arpan Losalka, Jonathan Scarlett

TL;DR
This paper introduces a new algorithm for safe Bayesian optimization that achieves low cumulative regret when the safety function is monotone in a specific variable, addressing a key challenge in safe exploration.
Contribution
The paper proposes a novel algorithm that attains sublinear regret in safe Bayesian optimization under monotonicity constraints on the safety function, expanding the theoretical understanding of safe exploration.
Findings
Achieves sublinear regret with monotone safety functions
Supports finding near-optimal actions for each context
Empirical results validate theoretical claims
Abstract
We consider the problem of sequentially maximizing an unknown function over a set of actions of the form , where the selected actions must satisfy a safety constraint with respect to an unknown safety function . We model and as lying in a reproducing kernel Hilbert space (RKHS), which facilitates the use of Gaussian process methods. While existing works for this setting have provided algorithms that are guaranteed to identify a near-optimal safe action, the problem of attaining low cumulative regret has remained largely unexplored, with a key challenge being that expanding the safe region can incur high regret. To address this challenge, we show that if is monotone with respect to just the single variable (with no such constraint on ), sublinear regret becomes achievable with our proposed algorithm. In addition, we show that a modified version…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Machine Learning and Algorithms · Advanced Statistical Process Monitoring
MethodsSparse Evolutionary Training · Gaussian Process
