Bayesian Safety Validation for Failure Probability Estimation of Black-Box Systems
Robert J. Moss, Mykel J. Kochenderfer, Maxime Gariel, Arthur Dubois

TL;DR
This paper presents a Bayesian optimization-based method for efficiently estimating failure probabilities in safety-critical black-box systems, reducing sample requirements and improving accuracy.
Contribution
It introduces a novel Bayesian safety validation framework with specialized acquisition functions for failure prediction and probability estimation in high-dimensional, expensive simulation scenarios.
Findings
Achieves more accurate failure probability estimates with fewer samples.
Performs well across multiple safety validation metrics.
Successfully applied to diverse test systems, including neural networks.
Abstract
Estimating the probability of failure is an important step in the certification of safety-critical systems. Efficient estimation methods are often needed due to the challenges posed by high-dimensional input spaces, risky test scenarios, and computationally expensive simulators. This work frames the problem of black-box safety validation as a Bayesian optimization problem and introduces a method that iteratively fits a probabilistic surrogate model to efficiently predict failures. The algorithm is designed to search for failures, compute the most-likely failure, and estimate the failure probability over an operating domain using importance sampling. We introduce three acquisition functions that aim to reduce uncertainty by covering the design space, optimize the analytically derived failure boundaries, and sample the predicted failure regions. Results show this Bayesian safety…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Software Reliability and Analysis Research · Advanced Statistical Methods and Models
MethodsTest
