A Dynamic Safety Shield for Safe and Efficient Reinforcement Learning of Navigation Tasks
Murad Dawood, Ahmed Shokry, Maren Bennewitz

TL;DR
This paper introduces a novel safety shield for reinforcement learning in navigation tasks that enhances exploration, reduces collisions, and improves goal achievement by combining optimization-based control with RL adaptability.
Contribution
It proposes a dynamic safety shield that adaptively tunes controller parameters, balancing safety and exploration in RL for navigation, outperforming existing safety methods.
Findings
Outperforms state-of-the-art baselines in simulation environments.
Achieves higher goals-to-collisions ratio than traditional safety shields.
Demonstrates effectiveness in real-world experiments.
Abstract
Reinforcement learning (RL) has been successfully applied to a variety of robotics applications, where it outperforms classical methods. However, the safety aspect of RL and the transfer to the real world remain an open challenge. A prominent field for tackling this challenge and ensuring the safety of the agents during training and execution is safe reinforcement learning. Safe RL can be achieved through constrained RL and safe exploration approaches. The former learns the safety constraints over the course of training to achieve a safe behavior by the end of training, at the cost of high number of collisions at earlier stages of the training. The latter offers robust safety by enforcing the safety constraints as hard constraints, which prevents collisions but hinders the exploration of the RL agent, resulting in lower rewards and poor performance. To overcome those drawbacks, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Autonomous Vehicle Technology and Safety · Occupational Health and Safety Research
