Evaluating Model-free Reinforcement Learning toward Safety-critical   Tasks

Linrui Zhang; Qin Zhang; Li Shen; Bo Yuan; Xueqian Wang; and Dacheng Tao

arXiv:2212.05727·cs.LG·December 13, 2022

Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks

Linrui Zhang, Qin Zhang, Li Shen, Bo Yuan, Xueqian Wang, and Dacheng Tao

PDF

Open Access 1 Video

TL;DR

This paper reviews safety-critical reinforcement learning methods, introduces a novel safety layer combining optimization and projection, and provides a unified evaluation toolkit and comparative analysis across multiple benchmarks.

Contribution

It proposes Unrolling Safety Layer (USL), a new method for enforcing safety constraints, and develops SafeRL-Kit for standardized evaluation of safety RL algorithms.

Findings

01

USL effectively balances reward and safety constraints.

02

Algorithms demonstrate robustness in zero-cost-return policies.

03

Unified toolkit facilitates fair comparison of safety RL methods.

Abstract

Safety comes first in many real-world applications involving autonomous agents. Despite a large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there is still a lack of high-quality evaluation of those algorithms that adheres to safety constraints at each decision step under complex and unknown dynamics. In this paper, we revisit prior work in this scope from the perspective of state-wise safe RL and categorize them as projection-based, recovery-based, and optimization-based approaches, respectively. Furthermore, we propose Unrolling Safety Layer (USL), a joint method that combines safety optimization and safety projection. This novel technique explicitly enforces hard constraints via the deep unrolling architecture and enjoys structural advantages in navigating the trade-off between reward improvement and constraint satisfaction. To facilitate further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks· underline

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics · Safety Systems Engineering in Autonomy