Digital Twin Supervised Reinforcement Learning Framework for Autonomous Underwater Navigation

Zamirddine Mari; Mohamad Motasem Nawaf; Pierre Drap

arXiv:2512.10925·cs.LG·December 12, 2025

Digital Twin Supervised Reinforcement Learning Framework for Autonomous Underwater Navigation

Zamirddine Mari, Mohamad Motasem Nawaf, Pierre Drap

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning framework using PPO for autonomous underwater navigation, demonstrating improved obstacle avoidance and transferability from simulation to real-world BlueROV2 operations.

Contribution

It introduces a novel PPO-based deep RL approach with a combined observation space for underwater navigation, validated in simulation and real-world BlueROV2 tests.

Findings

01

PPO policy outperforms DWA in cluttered environments

02

Deep RL approach reduces collision rates

03

Successful transfer from simulation to real-world BlueROV2

Abstract

Autonomous navigation in underwater environments remains a major challenge due to the absence of GPS, degraded visibility, and the presence of submerged obstacles. This article investigates these issues through the case of the BlueROV2, an open platform widely used for scientific experimentation. We propose a deep reinforcement learning approach based on the Proximal Policy Optimization (PPO) algorithm, using an observation space that combines target-oriented navigation information, a virtual occupancy grid, and ray-casting along the boundaries of the operational area. The learned policy is compared against a reference deterministic kinematic planner, the Dynamic Window Approach (DWA), commonly employed as a robust baseline for obstacle avoidance. The evaluation is conducted in a realistic simulation environment and complemented by validation on a physical BlueROV2 supervised by a 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUnderwater Vehicles and Communication Systems · Robotic Path Planning Algorithms · Reinforcement Learning in Robotics