TL;DR
This paper introduces an open-source multi-agent reinforcement learning platform for underwater AUV swarm target tracking, featuring a novel semantic policy enhancement and a unified evaluation protocol.
Contribution
It develops the first open-source platform connecting MARL frameworks with physically modeled AUV swarm tasks and proposes a semantic policy method for improved coordination.
Findings
The platform enables fair comparison of RL algorithms in underwater scenarios.
STG-MAPPO improves task phase recognition and cooperation among AUVs.
The code is publicly available for research use.
Abstract
Autonomous underwater vehicle (AUV) swarms are emerging as intelligent underwater networks, where each node must sense, communicate, process local data, and make decisions under severe acoustic constraints. Persistent underwater target tracking is a typical task with moving targets, changing communication topology, intermittent acoustic links, and limited observation for each AUV. Multi-agent reinforcement learning (MARL) is a natural candidate for distributed tracking, yet existing studies still lack a unified open-source platform for evaluating different MARL algorithms under six-degree-of-freedom AUV dynamics. In addition, policies trained with raw geometric states and low-level force actions often struggle to represent task phases, observation reliability, link quality, and local cooperation roles. This paper addresses these issues by developing an open-source MARL-AUV platform that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
