Optimal control barrier functions for RL based safe powertrain control
Habtamu Hailemichael, Beshah Ayalew, Andrej Ivanco

TL;DR
This paper develops optimal high-order control barrier functions to enhance safety in reinforcement learning-based vehicle powertrain control, enabling safer exploration and improved control performance.
Contribution
It introduces a novel optimal high-order CBF framework that reduces conservatism and ensures safety for RL in vehicle powertrain systems.
Findings
RL with high-order CBF achieves higher rewards
No crashes during training and evaluation
Better driver demand accommodation
Abstract
Reinforcement learning (RL) can improve control performance by seeking to learn optimal control policies in the end-use environment for vehicles and other systems. To accomplish this, RL algorithms need to sufficiently explore the state and action spaces. This presents inherent safety risks, and applying RL on safety-critical systems like vehicle powertrain control requires safety enforcement approaches. In this paper, we seek control-barrier function (CBF)-based safety certificates that demarcate safe regions where the RL agent could optimize the control performance. In particular, we derive optimal high-order CBFs that avoid conservatism while ensuring safety for a vehicle in traffic. We demonstrate the workings of the high-order CBF with an RL agent which uses a deep actor-critic architecture to learn to optimize fuel economy and other driver accommodation metrics. We find that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
