SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents

Amirhossein Zolfagharian; Manel Abdellatif; Lionel C. Briand; and Ramesh S

arXiv:2308.02594·cs.LG·February 6, 2026·2 cites

SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents

Amirhossein Zolfagharian, Manel Abdellatif, Lionel C. Briand, and Ramesh S

PDF

Open Access 1 Repo

TL;DR

SMARLA is a black-box safety monitoring system for deep reinforcement learning agents that predicts safety violations early using state abstraction and machine learning, improving safety in critical applications.

Contribution

Introduces SMARLA, a novel safety monitoring approach for DRL agents that predicts violations early using Q-values and state abstraction, with validated effectiveness.

Findings

01

Accurately predicts safety violations with low false positives.

02

Detects violations approximately halfway through agent execution.

03

Effective across multiple DRL case studies.

Abstract

Deep Reinforcement Learning (DRL) has made significant advancements in various fields, such as autonomous driving, healthcare, and robotics, by enabling agents to learn optimal policies through interactions with their environments. However, the application of DRL in safety-critical domains presents challenges, particularly concerning the safety of the learned policies. DRL agents, which are focused on maximizing rewards, may select unsafe actions, leading to safety violations. Runtime safety monitoring is thus essential to ensure the safe operation of these agents, especially in unpredictable and dynamic environments. This paper introduces SMARLA, a black-box safety monitoring approach specifically designed for DRL agents. SMARLA utilizes machine learning to predict safety violations by observing the agent's behavior during execution. The approach is based on Q-values, which reflect the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amirhosseinzlf/smarla
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Smart Grid Security and Resilience