Safe Inverse Reinforcement Learning via Control Barrier Function

Yue Yang; Letian Chen; Matthew Gombolay

arXiv:2212.02753·cs.RO·March 8, 2023

Safe Inverse Reinforcement Learning via Control Barrier Function

Yue Yang, Letian Chen, Matthew Gombolay

PDF

Open Access

TL;DR

This paper introduces CBFIRL, a safe inverse reinforcement learning framework that integrates control barrier functions to improve safety during robot learning from demonstrations, demonstrated through improved safety metrics in simulation domains.

Contribution

The paper proposes a novel CBFIRL framework that combines control barrier functions with IRL, jointly optimized via gradient descent for safer robot learning.

Findings

01

15-20% safety improvement in 2D racecar domain

02

50% safety improvement in 3D drone domain

03

Enhanced safety compared to IRL without CBF

Abstract

Learning from Demonstration (LfD) is a powerful method for enabling robots to perform novel tasks as it is often more tractable for a non-roboticist end-user to demonstrate the desired skill and for the robot to efficiently learn from the associated data than for a human to engineer a reward function for the robot to learn the skill via reinforcement learning (RL). Safety issues arise in modern LfD techniques, e.g., Inverse Reinforcement Learning (IRL), just as they do for RL; yet, safe learning in LfD has received little attention. In the context of agile robots, safety is especially vital due to the possibility of robot-environment collision, robot-human collision, and damage to the robot. In this paper, we propose a safe IRL framework, CBFIRL, that leverages the Control Barrier Function (CBF) to enhance the safety of the IRL policy. The core idea of CBFIRL is to combine a loss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Age of Information Optimization · Adversarial Robustness in Machine Learning