Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation

Enrico Marchesini; Davide Corsi; Alessandro Farinelli

arXiv:2112.10593·cs.LG·December 21, 2021

Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation

Enrico Marchesini, Davide Corsi, Alessandro Farinelli

PDF

Open Access

TL;DR

This paper introduces a new benchmark environment for safe deep reinforcement learning in aquatic navigation, combining novel training and verification strategies to improve safety and performance.

Contribution

It presents a crossover-based DRL training method and an interval analysis verification strategy, establishing a benchmark for safe aquatic navigation.

Findings

01

Crossover-based DRL outperforms prior approaches.

02

Verification quantifies property violations.

03

Benchmark facilitates future research in safe aquatic navigation.

Abstract

We propose a novel benchmark environment for Safe Reinforcement Learning focusing on aquatic navigation. Aquatic navigation is an extremely challenging task due to the non-stationary environment and the uncertainties of the robotic platform, hence it is crucial to consider the safety aspect of the problem, by analyzing the behavior of the trained network to avoid dangerous situations (e.g., collisions). To this end, we consider a value-based and policy-gradient Deep Reinforcement Learning (DRL) and we propose a crossover-based strategy that combines gradient-based and gradient-free DRL to improve sample-efficiency. Moreover, we propose a verification strategy based on interval analysis that checks the behavior of the trained models over a set of desired properties. Our results show that the crossover-based training outperforms prior DRL approaches, while our verification allows us to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Adversarial Robustness in Machine Learning