End-to-End Safe Reinforcement Learning through Barrier Functions for   Safety-Critical Continuous Control Tasks

Richard Cheng; Gabor Orosz; Richard M. Murray; Joel W. Burdick

arXiv:1903.08792·cs.LG·March 22, 2019·43 cites

End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks

Richard Cheng, Gabor Orosz, Richard M. Murray, Joel W. Burdick

PDF

Open Access 1 Repo

TL;DR

This paper introduces a safe reinforcement learning framework that combines model-free RL, control barrier functions, and online system modeling to ensure safety and improve learning efficiency in continuous control tasks.

Contribution

The paper presents RL-CBF, a novel controller synthesis algorithm that guarantees safety with high probability during learning, integrating model-free RL with model-based safety constraints.

Findings

01

RL-CBF guarantees safety during learning.

02

The approach improves sample efficiency over existing methods.

03

The method successfully controls an inverted pendulum and autonomous vehicle scenarios.

Abstract

Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and one main reason is the absence of safety guarantees during the learning process. Real world systems would realistically fail or break before an optimal controller can be learned. To address this issue, we propose a controller architecture that combines (1) a model-free RL-based controller with (2) model-based controllers utilizing control barrier functions (CBFs) and (3) on-line learning of the unknown system dynamics, in order to ensure safety during learning. Our general framework leverages the success of RL algorithms to learn high-performance controllers, while the CBF-based controllers both guarantee safety and guide the learning process by constraining the set of explorable polices. We utilize Gaussian Processes (GPs) to model the system dynamics and its uncertainties. Our novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rcheng805/RL-CBF
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Gaussian Processes and Bayesian Inference