Learning-Based Shielding for Safe Autonomy under Unknown Dynamics

Robert Reed; Morteza Lahijanian

arXiv:2410.07359·eess.SY·October 11, 2024

Learning-Based Shielding for Safe Autonomy under Unknown Dynamics

Robert Reed, Morteza Lahijanian

PDF

Open Access

TL;DR

This paper introduces a data-driven shielding approach for safe autonomous control of unknown systems using deep kernel learning and finite-state abstractions, enabling safety guarantees without known models.

Contribution

It presents a novel methodology combining deep kernel learning with Interval MDPs to provide safety assurances for unknown, continuous-state systems under black-box controllers.

Findings

01

Guarantees safety for unknown systems with high confidence.

02

Demonstrates effectiveness on nonlinear and high-dimensional systems.

03

Provides theoretical proofs of soundness and complexity.

Abstract

Shielding is a common method used to guarantee the safety of a system under a black-box controller, such as a neural network controller from deep reinforcement learning (DRL), with simpler, verified controllers. Existing shielding methods rely on formal verification through Markov Decision Processes (MDPs), assuming either known or finite-state models, which limits their applicability to DRL settings with unknown, continuous-state systems. This paper addresses these limitations by proposing a data-driven shielding methodology that guarantees safety for unknown systems under black-box controllers. The approach leverages Deep Kernel Learning to model the systems' one-step evolution with uncertainty quantification and constructs a finite-state abstraction as an Interval MDP (IMDP). By focusing on safety properties expressed in safe linear temporal logic (safe LTL), we develop an algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTarget Tracking and Data Fusion in Sensor Networks · Advanced Optical Sensing Technologies · Fault Detection and Control Systems

MethodsSparse Evolutionary Training