Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement   Learning

Davide Corsi; Davide Camponogara; Alessandro Farinelli

arXiv:2405.20534·cs.LG·June 3, 2024·2 cites

Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning

Davide Corsi, Davide Camponogara, Alessandro Farinelli

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new aquatic navigation benchmark for deep reinforcement learning, highlighting its challenges and proposing advanced training techniques to improve policy reliability and safety in complex environments.

Contribution

It presents a novel aquatic navigation environment for DRL, demonstrating its difficulty and proposing curriculum learning and hyperparameter tuning to enhance performance.

Findings

01

State-of-the-art DRL struggles with generalization and safety in aquatic navigation.

02

Advanced training techniques improve policy reliability.

03

Benchmark environment and baselines are publicly available.

Abstract

An exciting and promising frontier for Deep Reinforcement Learning (DRL) is its application to real-world robotic systems. While modern DRL approaches achieved remarkable successes in many robotic scenarios (including mobile robotics, surgical assistance, and autonomous driving) unpredictable and non-stationary environments can pose critical challenges to such methods. These features can significantly undermine fundamental requirements for a successful training process, such as the Markovian properties of the transition model. To address this challenge, we propose a new benchmarking environment for aquatic navigation using recent advances in the integration between game engines and DRL. In more detail, we show that our benchmarking environment is problematic even for state-of-the-art DRL approaches that may struggle to generate reliable policies in terms of generalization power and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dadecampo/SafeRLAUV
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUnderwater Vehicles and Communication Systems

MethodsFocus · Entropy Regularization · Proximal Policy Optimization