Learning to be Safe: Deep RL with a Safety Critic
Krishnan Srinivasan, Benjamin Eysenbach, Sehoon Ha, Jie Tan, Chelsea, Finn

TL;DR
This paper introduces a safety critic for deep reinforcement learning that learns safety constraints from prior tasks, enabling safer and faster learning in new environments and tasks, with fewer safety incidents.
Contribution
The paper proposes a safety critic that learns safety constraints across tasks, facilitating transfer learning for safer and more efficient deep RL.
Findings
Reduces safety incidents during learning
Enables faster convergence in new tasks
Improves stability of the learning process
Abstract
Safety is an essential component for deploying reinforcement learning (RL) algorithms in real-world scenarios, and is critical during the learning process itself. A natural first approach toward safe RL is to manually specify constraints on the policy's behavior. However, just as learning has enabled progress in large-scale development of AI systems, learning safety specifications may also be necessary to ensure safety in messy open-world environments where manual safety specifications cannot scale. Akin to how humans learn incrementally starting in child-safe environments, we propose to learn how to be safe in one set of tasks and environments, and then use that learned intuition to constrain future behaviors when learning new, modified tasks. We empirically study this form of safety-constrained transfer learning in three challenging domains: simulated navigation, quadruped locomotion,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
