Anomalous State Sequence Modeling to Enhance Safety in Reinforcement   Learning

Leen Kweider; Maissa Abou Kassem; Ubai Sandouk

arXiv:2407.19860·cs.LG·November 6, 2024

Anomalous State Sequence Modeling to Enhance Safety in Reinforcement Learning

Leen Kweider, Maissa Abou Kassem, Ubai Sandouk

PDF

Open Access

TL;DR

This paper introduces AnoSeqs, a novel safe reinforcement learning method that uses anomaly detection on state sequences to improve safety in high-stakes environments like self-driving cars.

Contribution

It presents a two-stage approach combining offline safe sequence collection and anomaly detection to guide risk-averse RL policy training in safety-critical settings.

Findings

01

Successfully learned safer policies in multiple benchmarks

02

Effective anomaly detection provided supervisory signals for safety

03

Improved safety performance in self-driving car simulations

Abstract

The deployment of artificial intelligence (AI) in decision-making applications requires ensuring an appropriate level of safety and reliability, particularly in changing environments that contain a large number of unknown observations. To address this challenge, we propose a novel safe reinforcement learning (RL) approach that utilizes an anomalous state sequence to enhance RL safety. Our proposed solution Safe Reinforcement Learning with Anomalous State Sequences (AnoSeqs) consists of two stages. First, we train an agent in a non-safety-critical offline 'source' environment to collect safe state sequences. Next, we use these safe sequences to build an anomaly detection model that can detect potentially unsafe state sequences in a 'target' safety-critical environment where failures can have high costs. The estimated risk from the anomaly detection model is utilized to train a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Security and Resilience · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection