Deep Probabilistic Accelerated Evaluation: A Robust Certifiable Rare-Event Simulation Methodology for Black-Box Safety-Critical Systems
Mansur Arief, Zhiyuan Huang, Guru Koushik Senthil Kumar, Yuanlu Bai,, Shengyi He, Wenhao Ding, Henry Lam, Ding Zhao

TL;DR
This paper introduces Deep-PrAE, a novel framework that enhances importance sampling for black-box safety-critical systems, providing statistically guaranteed bounds on rare-event probabilities through deep learning and theoretical analysis.
Contribution
Deep-PrAE is the first method to convert black-box samplers into statistically guaranteed importance samplers with relaxed efficiency certificates using deep neural networks.
Findings
Effective in safety-testing of intelligent driving algorithms
Provides bounds on rare-event probabilities with theoretical guarantees
Demonstrates improved efficiency over traditional methods
Abstract
Evaluating the reliability of intelligent physical systems against rare safety-critical events poses a huge testing burden for real-world applications. Simulation provides a useful platform to evaluate the extremal risks of these systems before their deployments. Importance Sampling (IS), while proven to be powerful for rare-event simulation, faces challenges in handling these learning-based systems due to their black-box nature that fundamentally undermines its efficiency guarantee, which can lead to under-estimation without diagnostically detected. We propose a framework called Deep Probabilistic Accelerated Evaluation (Deep-PrAE) to design statistically guaranteed IS, by converting black-box samplers that are versatile but could lack guarantees, into one with what we call a relaxed efficiency certificate that allows accurate estimation of bounds on the safety-critical event…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Statistical Distribution Estimation and Applications · Probability and Risk Models
