Safety Aware Reinforcement Learning (SARL)
Santiago Miret, Somdeb Majumdar, Carroll Wainwright

TL;DR
This paper introduces SARL, a framework where a virtual safe agent learns a task-independent safety policy to modulate various primary agents, reducing side effects across different tasks without additional training.
Contribution
SARL proposes a task-independent safety modulation mechanism that can be applied to multiple policies, enhancing safety and generalizability in reinforcement learning environments.
Findings
SARL effectively minimizes side effects in complex dynamic environments.
The safety agent's modulation matches task-specific solutions in performance.
SARL demonstrates portability across different tasks without retraining.
Abstract
As reinforcement learning agents become increasingly integrated into complex, real-world environments, designing for safety becomes a critical consideration. We specifically focus on researching scenarios where agents can cause undesired side effects while executing a policy on a primary task. Since one can define multiple tasks for a given environment dynamics, there are two important challenges. First, we need to abstract the concept of safety that applies broadly to that environment independent of the specific task being executed. Second, we need a mechanism for the abstracted notion of safety to modulate the actions of agents executing different policies to minimize their side-effects. In this work, we propose Safety Aware Reinforcement Learning (SARL) - a framework where a virtual safe agent modulates the actions of a main reward-based agent to minimize side effects. The safe agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety
