Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents
Nathaniel Hamilton, Kyle Dunlap, Taylor T Johnson, Kerianne L Hobbs

TL;DR
This paper systematically evaluates how run time assurance methods influence the training effectiveness and safety of reinforcement learning agents, highlighting best practices for fair assessment in safe RL research.
Contribution
It provides a comprehensive ablation study on RTA techniques in RL, establishing evaluation standards and analyzing their impact on learning and safety.
Findings
Certain RTA methods improve safety without hindering learning.
Dependence on RTA varies across different RL algorithms.
Reward shaping and safe exploration are crucial for effective training.
Abstract
Reinforcement Learning (RL) has become an increasingly important research area as the success of machine learning algorithms and methods grows. To combat the safety concerns surrounding the freedom given to RL agents while training, there has been an increase in work concerning Safe Reinforcement Learning (SRL). However, these new and safe methods have been held to less scrutiny than their unsafe counterparts. For instance, comparisons among safe methods often lack fair evaluation across similar initial condition bounds and hyperparameter settings, use poor evaluation metrics, and cherry-pick the best training runs rather than averaging over multiple random seeds. In this work, we conduct an ablation study using evaluation best practices to investigate the impact of run time assurance (RTA), which monitors the system state and intervenes to assure safety, on effective learning. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Formal Methods in Verification · Autonomous Vehicle Technology and Safety
