Verification-Guided Shielding for Deep Reinforcement Learning

Davide Corsi; Guy Amir; Andoni Rodriguez; Cesar Sanchez; Guy Katz; Roy; Fox

arXiv:2406.06507·cs.LG·June 24, 2024·1 cites

Verification-Guided Shielding for Deep Reinforcement Learning

Davide Corsi, Guy Amir, Andoni Rodriguez, Cesar Sanchez, Guy Katz, Roy, Fox

PDF

Open Access

TL;DR

This paper introduces verification-guided shielding, a novel method that combines formal verification and shielding to improve the safety and efficiency of deep reinforcement learning policies in safety-critical applications.

Contribution

It presents an integrated approach that reduces runtime overhead of shielding while maintaining formal safety guarantees by partitioning input space and compressing unsafe regions.

Findings

01

Significantly reduces runtime overhead compared to traditional shielding.

02

Effectively identifies safe and unsafe regions using combined verification methods.

03

Demonstrates scalability and safety in robotic navigation benchmarks.

Abstract

In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach to solving real-world tasks. However, despite their successes, DRL-based policies suffer from poor reliability, which limits their deployment in safety-critical domains. Various methods have been put forth to address this issue by providing formal safety guarantees. Two main approaches include shielding and verification. While shielding ensures the safe behavior of the policy by employing an external online component (i.e., a ``shield'') that overrides potentially dangerous actions, this approach has a significant computational cost as the shield must be invoked at runtime to validate every decision. On the other hand, verification is an offline process that can identify policies that are unsafe, prior to their deployment, yet, without providing alternative actions when such a policy is deemed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Memory and Neural Computing · Neuroscience and Neural Engineering