Safe Reinforcement Learning with Probabilistic Guarantees Satisfying   Temporal Logic Specifications in Continuous Action Spaces

Hanna Krasowski; Prithvi Akella; Aaron D. Ames; Matthias Althoff

arXiv:2212.06129·cs.RO·September 29, 2023

Safe Reinforcement Learning with Probabilistic Guarantees Satisfying Temporal Logic Specifications in Continuous Action Spaces

Hanna Krasowski, Prithvi Akella, Aaron D. Ames, Matthias Althoff

PDF

Open Access

TL;DR

This paper introduces a safe reinforcement learning framework for continuous spaces that guarantees probabilistic safety with respect to temporal logic specifications by combining verification and performance optimization steps.

Contribution

It presents a novel three-step method that separates safety verification from performance optimization, enabling explicit probabilistic safety guarantees in continuous action spaces.

Findings

01

Efficient learning in a robot evasion task.

02

Maintains safety guarantees during learning.

03

Improves performance while ensuring safety.

Abstract

Vanilla Reinforcement Learning (RL) can efficiently solve complex tasks but does not provide any guarantees on system behavior. To bridge this gap, we propose a three-step safe RL procedure for continuous action spaces that provides probabilistic guarantees with respect to temporal logic specifications. First, our approach probabilistically verifies a candidate controller with respect to a temporal logic specification while randomizing the control inputs to the system within a bounded set. Second, we improve the performance of this probabilistically verified controller by adding an RL agent that optimizes the verified controller for performance in the same bounded set around the control input. Third, we verify probabilistic safety guarantees with respect to temporal logic specifications for the learned agent. Our approach is efficiently implementable for continuous action and state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Safety Systems Engineering in Autonomy · Formal Methods in Verification