Evaluating Robustness of Reinforcement Learning Algorithms for   Autonomous Shipping

Bavo Lesy; Ali Anwar; Siegfried Mercelis

arXiv:2411.04915·cs.LG·November 8, 2024

Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping

Bavo Lesy, Ali Anwar, Siegfried Mercelis

PDF

Open Access

TL;DR

This paper evaluates the robustness of deep reinforcement learning algorithms, especially Soft-Actor Critic, for autonomous inland waterway shipping, demonstrating their ability to handle environmental disturbances and generalize to new port scenarios.

Contribution

It compares model-free and model-based RL algorithms in autonomous shipping, highlighting the robustness of Soft-Actor Critic in complex, dynamic environments.

Findings

01

Soft-Actor Critic outperforms MuZero in robustness to disturbances

02

Model-free RL can navigate unseen port environments effectively

03

The study advances RL frameworks for diverse vessel navigation scenarios

Abstract

Recently, there has been growing interest in autonomous shipping due to its potential to improve maritime efficiency and safety. The use of advanced technologies, such as artificial intelligence, can address the current navigational and operational challenges in autonomous shipping. In particular, inland waterway transport (IWT) presents a unique set of challenges, such as crowded waterways and variable environmental conditions. In such dynamic settings, the reliability and robustness of autonomous shipping solutions are critical factors for ensuring safe operations. This paper examines the robustness of benchmark deep reinforcement learning (RL) algorithms, implemented for IWT within an autonomous shipping simulator, and their ability to generate effective motion planning policies. We demonstrate that a model-free approach can achieve an adequate policy in the simulator, successfully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMaritime Ports and Logistics · Transportation and Mobility Innovations · Maritime Navigation and Safety

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Monte-Carlo Tree Search · Batch Normalization · Prioritized Experience Replay · Average Pooling · Convolution · Residual Connection · Residual Block · MuZero · Sparse Evolutionary Training