Verifiably Safe Exploration for End-to-End Reinforcement Learning

Nathan Hunt; Nathan Fulton; Sara Magliacane; Nghia Hoang; Subhro Das,; Armando Solar-Lezama

arXiv:2007.01223·cs.AI·July 3, 2020

Verifiably Safe Exploration for End-to-End Reinforcement Learning

Nathan Hunt, Nathan Fulton, Sara Magliacane, Nghia Hoang, Subhro Das,, Armando Solar-Lezama

PDF

1 Repo

TL;DR

This paper introduces a novel method for ensuring safety constraints in end-to-end reinforcement learning with visual inputs, enabling safe exploration in safety-critical applications.

Contribution

It presents the first approach to enforce formal safety constraints on visual policy learning, combining object detection and automated reasoning.

Findings

01

Algorithm avoids unsafe behaviors in all benchmark problems.

02

Method preserves all safe policies from the original environment.

03

Approach remains competitive in reward optimization.

Abstract

Deploying deep reinforcement learning in safety-critical settings requires developing algorithms that obey hard constraints during exploration. This paper contributes a first approach toward enforcing formal safety constraints on end-to-end policies with visual inputs. Our approach draws on recent advances in object detection and automated reasoning for hybrid dynamical systems. The approach is evaluated on a novel benchmark that emphasizes the challenge of safely exploring in the presence of hard constraints. Our benchmark draws from several proposed problem sets for safe learning and includes problems that emphasize challenges such as reward signals that are not aligned with safety constraints. On each of these benchmark problems, our algorithm completely avoids unsafe behavior while remaining competitive at optimizing for as much reward as is safe. We also prove that our method of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/vsrl-framework
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.