A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun, Wang, Alois Knoll

TL;DR
This paper reviews the current state of safe reinforcement learning, discussing methods, theories, applications, and benchmarks, and highlights key challenges to guide future research in deploying safe RL in real-world scenarios.
Contribution
It provides a comprehensive overview of safe RL from multiple perspectives, introduces the '2H3W' framework for deployment challenges, and releases an open-source repository of algorithms.
Findings
Identifies five crucial problems for safe RL deployment.
Analyzes sample complexity and theoretical progress.
Provides benchmarks and open-source implementations.
Abstract
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-making tasks. However, safety concerns are raised during deploying RL in real-world applications, leading to a growing demand for safe RL algorithms, such as in autonomous driving and robotics scenarios. While safe control has a long history, the study of safe RL algorithms is still in the early stages. To establish a good foundation for future safe RL research, in this paper, we provide a review of safe RL from the perspectives of methods, theories, and applications. Firstly, we review the progress of safe RL from five dimensions and come up with five crucial problems for safe RL being deployed in real-world applications, coined as "2H3W". Secondly, we analyze the algorithm and theory progress from the perspectives of answering the "2H3W" problems. Particularly, the sample complexity of safe RL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSafety Systems Engineering in Autonomy · Traffic control and management · Software Reliability and Analysis Research
