SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian, Manel Abdellatif, Lionel C. Briand, and Ramesh S

TL;DR
SMARLA is a black-box safety monitoring system for deep reinforcement learning agents that predicts safety violations early using state abstraction and machine learning, improving safety in critical applications.
Contribution
Introduces SMARLA, a novel safety monitoring approach for DRL agents that predicts violations early using Q-values and state abstraction, with validated effectiveness.
Findings
Accurately predicts safety violations with low false positives.
Detects violations approximately halfway through agent execution.
Effective across multiple DRL case studies.
Abstract
Deep Reinforcement Learning (DRL) has made significant advancements in various fields, such as autonomous driving, healthcare, and robotics, by enabling agents to learn optimal policies through interactions with their environments. However, the application of DRL in safety-critical domains presents challenges, particularly concerning the safety of the learned policies. DRL agents, which are focused on maximizing rewards, may select unsafe actions, leading to safety violations. Runtime safety monitoring is thus essential to ensure the safe operation of these agents, especially in unpredictable and dynamic environments. This paper introduces SMARLA, a black-box safety monitoring approach specifically designed for DRL agents. SMARLA utilizes machine learning to predict safety violations by observing the agent's behavior during execution. The approach is based on Q-values, which reflect the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Smart Grid Security and Resilience
