Spatial-temporal recurrent reinforcement learning for autonomous ships
Martin Waltz, Ostap Okhrin

TL;DR
This paper introduces a novel spatial-temporal recurrent neural network architecture for deep Q-networks, enabling autonomous ships to navigate safely and efficiently in complex maritime environments with multiple ships and partial observability.
Contribution
It presents a new neural network design that handles multiple targets, incorporates maritime collision rules, and demonstrates robustness and compatibility in multi-agent scenarios.
Findings
Outperforms traditional collision avoidance methods in simulations.
Shows robustness in multi-ship scenarios with partial observability.
Compatible with various deep reinforcement learning algorithms.
Abstract
This paper proposes a spatial-temporal recurrent neural network architecture for deep -networks that can be used to steer an autonomous ship. The network design makes it possible to handle an arbitrary number of surrounding target ships while offering robustness to partial observability. Furthermore, a state-of-the-art collision risk metric is proposed to enable an easier assessment of different situations by the agent. The COLREG rules of maritime traffic are explicitly considered in the design of the reward function. The final policy is validated on a custom set of newly created single-ship encounters called `Around the Clock' problems and the commonly used Imazu (1987) problems, which include 18 multi-ship scenarios. Performance comparisons with artificial potential field and velocity obstacle methods demonstrate the potential of the proposed approach for maritime path planning.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety · Infrastructure Resilience and Vulnerability Analysis · Autonomous Vehicle Technology and Safety
