Runtime Safety Assurance Using Reinforcement Learning

Christopher Lazarus; James G. Lopez; Mykel J. Kochenderfer

arXiv:2010.10618·cs.LG·October 22, 2020

Runtime Safety Assurance Using Reinforcement Learning

Christopher Lazarus, James G. Lopez, Mykel J. Kochenderfer

PDF

TL;DR

This paper introduces a reinforcement learning-based meta-controller for Runtime Safety Assurance in autopilots, effectively balancing safety and operational efficiency without relying heavily on domain expertise.

Contribution

It presents a novel RL approach to designing a meta-controller for RTSA, outperforming traditional human-engineered methods in safety verification tasks.

Findings

01

RL-based meta-controller achieves higher safety accuracy

02

Outperforms baseline human-engineered approaches

03

Demonstrates effectiveness in complex, high-dimensional systems

Abstract

The airworthiness and safety of a non-pedigreed autopilot must be verified, but the cost to formally do so can be prohibitive. We can bypass formal verification of non-pedigreed components by incorporating Runtime Safety Assurance (RTSA) as mechanism to ensure safety. RTSA consists of a meta-controller that observes the inputs and outputs of a non-pedigreed component and verifies formally specified behavior as the system operates. When the system is triggered, a verified recovery controller is deployed. Recovery controllers are designed to be safe but very likely disruptive to the operational objective of the system, and thus RTSA systems must balance safety and efficiency. The objective of this paper is to design a meta-controller capable of identifying unsafe situations with high accuracy. High dimensional and non-linear dynamics in which modern controllers are deployed along with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.