Multiple Ships Cooperative Navigation and Collision Avoidance using Multi-agent Reinforcement Learning with Communication
Y. Wang, Y. Zhao

TL;DR
This paper introduces a multi-agent reinforcement learning framework using MADDPG with communication for cooperative navigation and collision avoidance among ships, effectively handling partial observability and improving coordination.
Contribution
It applies MADDPG with communication to multi-ship tasks, demonstrating improved cooperation and communication protocols under partial observability.
Findings
Outperforms traditional single-agent algorithms
Agents develop effective communication protocols
Handles external noise and partial observability effectively
Abstract
In the real world, unmanned surface vehicles (USV) often need to coordinate with each other to accomplish specific tasks. However, achieving cooperative control in multi-agent systems is challenging due to issues such as non-stationarity and partial observability. Recent advancements in Multi-Agent Reinforcement Learning (MARL) provide new perspectives to address these challenges. Therefore, we propose using the multi-agent deep deterministic policy gradient (MADDPG) algorithm with communication to address multiple ships' cooperation problems under partial observability. We developed two tasks based on OpenAI's gym environment: cooperative navigation and cooperative collision avoidance. In these tasks, ships must not only learn effective control strategies but also establish communication protocols with other agents. We analyze the impact of external noise on communication, the effect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMaritime Navigation and Safety · Maritime Ports and Logistics · Maritime Security and History
