Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks
Linrui Zhang, Qin Zhang, Li Shen, Bo Yuan, Xueqian Wang, and Dacheng Tao

TL;DR
This paper reviews safety-critical reinforcement learning methods, introduces a novel safety layer combining optimization and projection, and provides a unified evaluation toolkit and comparative analysis across multiple benchmarks.
Contribution
It proposes Unrolling Safety Layer (USL), a new method for enforcing safety constraints, and develops SafeRL-Kit for standardized evaluation of safety RL algorithms.
Findings
USL effectively balances reward and safety constraints.
Algorithms demonstrate robustness in zero-cost-return policies.
Unified toolkit facilitates fair comparison of safety RL methods.
Abstract
Safety comes first in many real-world applications involving autonomous agents. Despite a large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there is still a lack of high-quality evaluation of those algorithms that adheres to safety constraints at each decision step under complex and unknown dynamics. In this paper, we revisit prior work in this scope from the perspective of state-wise safe RL and categorize them as projection-based, recovery-based, and optimization-based approaches, respectively. Furthermore, we propose Unrolling Safety Layer (USL), a joint method that combines safety optimization and safety projection. This novel technique explicitly enforces hard constraints via the deep unrolling architecture and enjoys structural advantages in navigating the trade-off between reward improvement and constraint satisfaction. To facilitate further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Safety Systems Engineering in Autonomy
