Cautious Adaptation For Reinforcement Learning in Safety-Critical   Settings

Jesse Zhang; Brian Cheung; Chelsea Finn; Sergey Levine; Dinesh; Jayaraman

arXiv:2008.06622·cs.LG·January 19, 2021·26 cites

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Jesse Zhang, Brian Cheung, Chelsea Finn, Sergey Levine, Dinesh, Jayaraman

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CARL, a risk-averse reinforcement learning method that trains in diverse source environments to enable cautious adaptation in safety-critical target settings, improving safety and performance.

Contribution

The paper proposes a novel risk-aware adaptation framework, CARL, that leverages model-based RL to estimate uncertainties and guide cautious exploration in safety-critical environments.

Findings

01

CARL achieves higher rewards than baselines in safety-critical tasks.

02

CARL reduces failure rates during adaptation.

03

It effectively balances exploration and safety in diverse environments.

Abstract

Reinforcement learning (RL) in real-world safety-critical target settings like urban driving is hazardous, imperiling the RL agent, other agents, and the environment. To overcome this difficulty, we propose a "safety-critical adaptation" task setting: an agent first trains in non-safety-critical "source" environments such as in a simulator, before it adapts to the target environment where failures carry heavy costs. We propose a solution approach, CARL, that builds on the intuition that prior experience in diverse environments equips an agent to estimate risk, which in turn enables relative safety through risk-averse, cautious adaptation. CARL first employs model-based RL to train a probabilistic model to capture uncertainty about transition dynamics and catastrophic states across varied source environments. Then, when exploring a new safety-critical environment with unknown dynamics,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jesbu1/carl
pytorch

Videos

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings· slideslive

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety