Perturbation-mitigated USV Navigation with Distributionally Robust Reinforcement Learning
Zhaofan Zhang, Minghao Yang, Sihong Xie, Hui Xiong

TL;DR
This paper introduces DRIQN, a novel distributionally robust reinforcement learning method that enhances USV navigation safety and efficiency in noisy, unpredictable marine environments by optimizing worst-case performance.
Contribution
The paper proposes DRIQN, integrating DRO with implicit quantile networks and explicit subgroup modeling to improve robustness against heterogeneous environmental noise in USV navigation.
Findings
DRIQN achieves +13.51% success rate over state-of-the-art.
Reduces collision rate by -12.28%.
Improves time and energy efficiency significantly.
Abstract
The robustness of Unmanned Surface Vehicles (USV) is crucial when facing unknown and complex marine environments, especially when heteroscedastic observational noise poses significant challenges to sensor-based navigation tasks. Recently, Distributional Reinforcement Learning (DistRL) has shown promising results in some challenging autonomous navigation tasks without prior environmental information. However, these methods overlook situations where noise patterns vary across different environmental conditions, hindering safe navigation and disrupting the learning of value functions. To address the problem, we propose DRIQN to integrate Distributionally Robust Optimization (DRO) with implicit quantile networks to optimize worst-case performance under natural environmental conditions. Leveraging explicit subgroup modeling in the replay buffer, DRIQN incorporates heterogeneous noise sources…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety · Underwater Vehicles and Communication Systems · Reinforcement Learning in Robotics
