Runtime Safety Assurance Using Reinforcement Learning
Christopher Lazarus, James G. Lopez, Mykel J. Kochenderfer

TL;DR
This paper introduces a reinforcement learning-based meta-controller for Runtime Safety Assurance in autopilots, effectively balancing safety and operational efficiency without relying heavily on domain expertise.
Contribution
It presents a novel RL approach to designing a meta-controller for RTSA, outperforming traditional human-engineered methods in safety verification tasks.
Findings
RL-based meta-controller achieves higher safety accuracy
Outperforms baseline human-engineered approaches
Demonstrates effectiveness in complex, high-dimensional systems
Abstract
The airworthiness and safety of a non-pedigreed autopilot must be verified, but the cost to formally do so can be prohibitive. We can bypass formal verification of non-pedigreed components by incorporating Runtime Safety Assurance (RTSA) as mechanism to ensure safety. RTSA consists of a meta-controller that observes the inputs and outputs of a non-pedigreed component and verifies formally specified behavior as the system operates. When the system is triggered, a verified recovery controller is deployed. Recovery controllers are designed to be safe but very likely disruptive to the operational objective of the system, and thus RTSA systems must balance safety and efficiency. The objective of this paper is to design a meta-controller capable of identifying unsafe situations with high accuracy. High dimensional and non-linear dynamics in which modern controllers are deployed along with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
