Perturbation-mitigated USV Navigation with Distributionally Robust Reinforcement Learning

Zhaofan Zhang; Minghao Yang; Sihong Xie; Hui Xiong

arXiv:2512.00030·cs.RO·December 2, 2025

Perturbation-mitigated USV Navigation with Distributionally Robust Reinforcement Learning

Zhaofan Zhang, Minghao Yang, Sihong Xie, Hui Xiong

PDF

Open Access

TL;DR

This paper introduces DRIQN, a novel distributionally robust reinforcement learning method that enhances USV navigation safety and efficiency in noisy, unpredictable marine environments by optimizing worst-case performance.

Contribution

The paper proposes DRIQN, integrating DRO with implicit quantile networks and explicit subgroup modeling to improve robustness against heterogeneous environmental noise in USV navigation.

Findings

01

DRIQN achieves +13.51% success rate over state-of-the-art.

02

Reduces collision rate by -12.28%.

03

Improves time and energy efficiency significantly.

Abstract

The robustness of Unmanned Surface Vehicles (USV) is crucial when facing unknown and complex marine environments, especially when heteroscedastic observational noise poses significant challenges to sensor-based navigation tasks. Recently, Distributional Reinforcement Learning (DistRL) has shown promising results in some challenging autonomous navigation tasks without prior environmental information. However, these methods overlook situations where noise patterns vary across different environmental conditions, hindering safe navigation and disrupting the learning of value functions. To address the problem, we propose DRIQN to integrate Distributionally Robust Optimization (DRO) with implicit quantile networks to optimize worst-case performance under natural environmental conditions. Leveraging explicit subgroup modeling in the replay buffer, DRIQN incorporates heterogeneous noise sources…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMaritime Navigation and Safety · Underwater Vehicles and Communication Systems · Reinforcement Learning in Robotics