Synthesis and Deployment of Maximal Robust Control Barrier Functions through Adversarial Reinforcement Learning
Donggeon David Oh, Duy P. Nguyen, Haimin Hu, Jaime Fern\'andez Fisac

TL;DR
This paper introduces a novel reinforcement learning-based framework for synthesizing robust control barrier functions applicable to general nonlinear systems with black-box dynamics, enhancing safety guarantees under uncertainty.
Contribution
It develops a new robust CBF framework using Q-functions and adversarial RL, enabling safety enforcement without explicit dynamics models.
Findings
Achieved less conservative safe sets on inverted pendulum benchmark.
Demonstrated reliable safety enforcement on a 36-D quadruped simulator.
Validated the approach's effectiveness under adversarial uncertainty.
Abstract
Robust control barrier functions (CBFs) provide a principled mechanism for smooth safety enforcement under worst-case disturbances. However, existing approaches typically rely on explicit, closed-form structure in the dynamics (e.g., control-affine) and uncertainty models. This has led to limited scalability and generality, with most robust CBFs certifying only conservative subsets of the maximal robust safe set. In this paper, we introduce a new robust CBF framework for general nonlinear systems under bounded uncertainty. We first show that the safety value function solving the dynamic programming Isaacs equation is a valid robust discrete-time CBF that enforces safety on the maximal robust safe set. We then adopt the key reinforcement learning (RL) notion of quality function (or Q-function), which removes the need for explicit dynamics by lifting the barrier certificate into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
