Learning to be Safe: Deep RL with a Safety Critic

Krishnan Srinivasan; Benjamin Eysenbach; Sehoon Ha; Jie Tan; Chelsea; Finn

arXiv:2010.14603·cs.LG·October 29, 2020·26 cites

Learning to be Safe: Deep RL with a Safety Critic

Krishnan Srinivasan, Benjamin Eysenbach, Sehoon Ha, Jie Tan, Chelsea, Finn

PDF

Open Access

TL;DR

This paper introduces a safety critic for deep reinforcement learning that learns safety constraints from prior tasks, enabling safer and faster learning in new environments and tasks, with fewer safety incidents.

Contribution

The paper proposes a safety critic that learns safety constraints across tasks, facilitating transfer learning for safer and more efficient deep RL.

Findings

01

Reduces safety incidents during learning

02

Enables faster convergence in new tasks

03

Improves stability of the learning process

Abstract

Safety is an essential component for deploying reinforcement learning (RL) algorithms in real-world scenarios, and is critical during the learning process itself. A natural first approach toward safe RL is to manually specify constraints on the policy's behavior. However, just as learning has enabled progress in large-scale development of AI systems, learning safety specifications may also be necessary to ensure safety in messy open-world environments where manual safety specifications cannot scale. Akin to how humans learn incrementally starting in child-safe environments, we propose to learn how to be safe in one set of tasks and environments, and then use that learned intuition to constrain future behaviors when learning new, modified tasks. We empirically study this form of safety-constrained transfer learning in three challenging domains: simulated navigation, quadruped locomotion,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning