Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping
Bavo Lesy, Ali Anwar, Siegfried Mercelis

TL;DR
This paper evaluates the robustness of deep reinforcement learning algorithms, especially Soft-Actor Critic, for autonomous inland waterway shipping, demonstrating their ability to handle environmental disturbances and generalize to new port scenarios.
Contribution
It compares model-free and model-based RL algorithms in autonomous shipping, highlighting the robustness of Soft-Actor Critic in complex, dynamic environments.
Findings
Soft-Actor Critic outperforms MuZero in robustness to disturbances
Model-free RL can navigate unseen port environments effectively
The study advances RL frameworks for diverse vessel navigation scenarios
Abstract
Recently, there has been growing interest in autonomous shipping due to its potential to improve maritime efficiency and safety. The use of advanced technologies, such as artificial intelligence, can address the current navigational and operational challenges in autonomous shipping. In particular, inland waterway transport (IWT) presents a unique set of challenges, such as crowded waterways and variable environmental conditions. In such dynamic settings, the reliability and robustness of autonomous shipping solutions are critical factors for ensuring safe operations. This paper examines the robustness of benchmark deep reinforcement learning (RL) algorithms, implemented for IWT within an autonomous shipping simulator, and their ability to generate effective motion planning policies. We demonstrate that a model-free approach can achieve an adequate policy in the simulator, successfully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Ports and Logistics · Transportation and Mobility Innovations · Maritime Navigation and Safety
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Monte-Carlo Tree Search · Batch Normalization · Prioritized Experience Replay · Average Pooling · Convolution · Residual Connection · Residual Block · MuZero · Sparse Evolutionary Training
