Learning Conservative Neural Control Barrier Functions from Offline Data
Ihab Tabbara, Hussein Sibai

TL;DR
This paper introduces an offline learning algorithm for neural control barrier functions, called CCBFs, which enhance safety in dynamical systems by preventing unsafe and out-of-distribution states, outperforming existing methods.
Contribution
The paper proposes a novel offline training algorithm for neural control barrier functions that improve safety and reliability in control systems, inspired by Conservative Q-learning.
Findings
CCBFs outperform existing safety methods in experiments
CCBFs effectively prevent reaching unsafe states
CCBFs maintain task performance with minimal safety compromise
Abstract
Safety filters, particularly those based on control barrier functions, have gained increased interest as effective tools for safe control of dynamical systems. Existing correct-by-construction synthesis algorithms for such filters, however, suffer from the curse-of-dimensionality. Deep learning approaches have been proposed in recent years to address this challenge. In this paper, we add to this set of approaches an algorithm for training neural control barrier functions from offline datasets. Such functions can be used to design constraints for quadratic programs that are then used as safety filters. Our algorithm trains these functions so that the system is not only prevented from reaching unsafe states but is also disincentivized from reaching out-of-distribution ones, at which they would be less reliable. It is inspired by Conservative Q-learning, an offline reinforcement learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Formal Methods in Verification
