Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation
Enrico Marchesini, Davide Corsi, Alessandro Farinelli

TL;DR
This paper introduces a new benchmark environment for safe deep reinforcement learning in aquatic navigation, combining novel training and verification strategies to improve safety and performance.
Contribution
It presents a crossover-based DRL training method and an interval analysis verification strategy, establishing a benchmark for safe aquatic navigation.
Findings
Crossover-based DRL outperforms prior approaches.
Verification quantifies property violations.
Benchmark facilitates future research in safe aquatic navigation.
Abstract
We propose a novel benchmark environment for Safe Reinforcement Learning focusing on aquatic navigation. Aquatic navigation is an extremely challenging task due to the non-stationary environment and the uncertainties of the robotic platform, hence it is crucial to consider the safety aspect of the problem, by analyzing the behavior of the trained network to avoid dangerous situations (e.g., collisions). To this end, we consider a value-based and policy-gradient Deep Reinforcement Learning (DRL) and we propose a crossover-based strategy that combines gradient-based and gradient-free DRL to improve sample-efficiency. Moreover, we propose a verification strategy based on interval analysis that checks the behavior of the trained models over a set of desired properties. Our results show that the crossover-based training outperforms prior DRL approaches, while our verification allows us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning
