Barrier Function Overrides For Non-Convex Fixed Wing Flight Control and Self-Driving Cars
Eric Squires, Phillip Odom, Zsolt Kira

TL;DR
This paper develops novel barrier functions for non-convex control systems in discrete time, enabling safe reinforcement learning in robotics applications like aircraft and autonomous cars with minimal safety violations.
Contribution
It introduces new barrier functions tailored for non-convex, discrete-time systems, providing approximate solutions that balance safety, performance, and computational efficiency.
Findings
Approximate barrier functions achieve zero safety violations.
Performance comparable to baseline RL methods.
Tradeoffs between safety, performance, and computational tractability.
Abstract
Reinforcement Learning (RL) has enabled vast performance improvements for robotics systems. To achieve these results though, the agent often must randomly explore the environment, which for safety critical systems presents a significant challenge. Barrier functions can solve this challenge by enabling an override that approximates the RL control input as closely as possible without violating a safety constraint. Unfortunately, this override can be computationally intractable in cases where the dynamics are not convex in the control input or when time is discrete, as is often the case when training RL systems. We therefore consider these cases, developing novel barrier functions for two non-convex systems (fixed wing aircraft and self-driving cars performing lane merging with adaptive cruise control) in discrete time. Although solving for an online and optimal override is in general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Adversarial Robustness in Machine Learning
