Tolerance of Reinforcement Learning Controllers against Deviations in   Cyber Physical Systems

Changjian Zhang; Parv Kapoor; Eunsuk Kang; Romulo Meira-Goes; David; Garlan; Akila Ganlath; Shatadal Mishra; Nejib Ammar

arXiv:2406.17066·eess.SY·June 26, 2024

Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems

Changjian Zhang, Parv Kapoor, Eunsuk Kang, Romulo Meira-Goes, David, Garlan, Akila Ganlath, Shatadal Mishra, Nejib Ammar

PDF

Open Access

TL;DR

This paper introduces a new notion of system tolerance for RL-controlled cyber-physical systems, proposing a framework and heuristic to identify small deviations that violate system requirements, enhancing safety analysis.

Contribution

It presents a novel, expressive definition of tolerance, a new analysis problem called tolerance falsification, and a simulation-based framework with heuristics for finding small violations.

Findings

01

Effective identification of small tolerance violations

02

Framework applicable to various uncertainties and disturbances

03

Demonstrated success on benchmark problems

Abstract

Cyber-physical systems (CPS) with reinforcement learning (RL)-based controllers are increasingly being deployed in complex physical environments such as autonomous vehicles, the Internet-of-Things(IoT), and smart cities. An important property of a CPS is tolerance; i.e., its ability to function safely under possible disturbances and uncertainties in the actual operation. In this paper, we introduce a new, expressive notion of tolerance that describes how well a controller is capable of satisfying a desired system requirement, specified using Signal Temporal Logic (STL), under possible deviations in the system. Based on this definition, we propose a novel analysis problem, called the tolerance falsification problem, which involves finding small deviations that result in a violation of the given requirement. We present a novel, two-layer simulation-based analysis framework and a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Research in Systems and Signal Processing

MethodsSparse Evolutionary Training