Safe and Efficient Reinforcement Learning Using   Disturbance-Observer-Based Control Barrier Functions

Yikun Cheng; Pan Zhao; Naira Hovakimyan

arXiv:2211.17250·cs.RO·August 30, 2023·5 cites

Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions

Yikun Cheng, Pan Zhao, Naira Hovakimyan

PDF

Open Access

TL;DR

This paper introduces a novel safe reinforcement learning approach that combines disturbance observers with control barrier functions, enabling efficient exploration and safety guarantees without the need for model learning.

Contribution

The proposed method leverages disturbance observers to accurately estimate uncertainties and incorporate them into CBFs, improving safety and efficiency in model-free RL.

Findings

01

Outperforms state-of-the-art safe RL algorithms in safety violation rate

02

Reduces sample and computational requirements

03

Ensures safety throughout the learning process

Abstract

Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Ecosystem dynamics and resilience