Optimal control barrier functions for RL based safe powertrain control

Habtamu Hailemichael; Beshah Ayalew; Andrej Ivanco

arXiv:2405.11391·eess.SY·May 21, 2024

Optimal control barrier functions for RL based safe powertrain control

Habtamu Hailemichael, Beshah Ayalew, Andrej Ivanco

PDF

TL;DR

This paper develops optimal high-order control barrier functions to enhance safety in reinforcement learning-based vehicle powertrain control, enabling safer exploration and improved control performance.

Contribution

It introduces a novel optimal high-order CBF framework that reduces conservatism and ensures safety for RL in vehicle powertrain systems.

Findings

01

RL with high-order CBF achieves higher rewards

02

No crashes during training and evaluation

03

Better driver demand accommodation

Abstract

Reinforcement learning (RL) can improve control performance by seeking to learn optimal control policies in the end-use environment for vehicles and other systems. To accomplish this, RL algorithms need to sufficiently explore the state and action spaces. This presents inherent safety risks, and applying RL on safety-critical systems like vehicle powertrain control requires safety enforcement approaches. In this paper, we seek control-barrier function (CBF)-based safety certificates that demarcate safe regions where the RL agent could optimize the control performance. In particular, we derive optimal high-order CBFs that avoid conservatism while ensuring safety for a vehicle in traffic. We demonstrate the workings of the high-order CBF with an RL agent which uses a deep actor-critic architecture to learn to optimize fuel economy and other driver accommodation metrics. We find that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.