Probabilistic Reach-Avoid for Bayesian Neural Networks

Matthew Wicker; Luca Laurenti; Andrea Patane; Nicola Paoletti,; Alessandro Abate; Marta Kwiatkowska

arXiv:2310.01951·cs.LG·October 4, 2023

Probabilistic Reach-Avoid for Bayesian Neural Networks

Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti,, Alessandro Abate, Marta Kwiatkowska

PDF

Open Access 1 Repo

TL;DR

This paper presents a method to compute safety guarantees and synthesize optimal control policies for Bayesian neural network models in stochastic environments, enhancing safety certification in model-based reinforcement learning.

Contribution

It introduces a novel approach combining interval propagation and backward recursion to compute lower bounds on reach-avoid probabilities and synthesizes policies maximizing these bounds.

Findings

01

Over four-fold increase in certifiable states on benchmarks.

02

More than three-fold increase in guaranteed reach-avoid probability.

03

Effective safety certification for BNN-based control policies.

Abstract

Model-based reinforcement learning seeks to simultaneously learn the dynamics of an unknown stochastic environment and synthesise an optimal policy for acting in it. Ensuring the safety and robustness of sequential decisions made through a policy in such an environment is a key challenge for policies intended for safety-critical scenarios. In this work, we investigate two complementary problems: first, computing reach-avoid probabilities for iterative predictions made with dynamical models, with dynamics described by Bayesian neural network (BNN); second, synthesising control policies that are optimal with respect to a given reach-avoid specification (reaching a "target" state, while avoiding a set of "unsafe" states) and a learned BNN model. Our solution leverages interval propagation and backward recursion techniques to compute lower bounds for the probability that a policy's sequence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matthewwicker/bnnreachavoid
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Algorithms