Verification-Guided Shielding for Deep Reinforcement Learning
Davide Corsi, Guy Amir, Andoni Rodriguez, Cesar Sanchez, Guy Katz, Roy, Fox

TL;DR
This paper introduces verification-guided shielding, a novel method that combines formal verification and shielding to improve the safety and efficiency of deep reinforcement learning policies in safety-critical applications.
Contribution
It presents an integrated approach that reduces runtime overhead of shielding while maintaining formal safety guarantees by partitioning input space and compressing unsafe regions.
Findings
Significantly reduces runtime overhead compared to traditional shielding.
Effectively identifies safe and unsafe regions using combined verification methods.
Demonstrates scalability and safety in robotic navigation benchmarks.
Abstract
In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach to solving real-world tasks. However, despite their successes, DRL-based policies suffer from poor reliability, which limits their deployment in safety-critical domains. Various methods have been put forth to address this issue by providing formal safety guarantees. Two main approaches include shielding and verification. While shielding ensures the safe behavior of the policy by employing an external online component (i.e., a ``shield'') that overrides potentially dangerous actions, this approach has a significant computational cost as the shield must be invoked at runtime to validate every decision. On the other hand, verification is an offline process that can identify policies that are unsafe, prior to their deployment, yet, without providing alternative actions when such a policy is deemed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Memory and Neural Computing · Neuroscience and Neural Engineering
