Task-Semantic Graph-Driven Distributed Agent Networking for Underwater Target Tracking

Shengchao Zhu; Guangjie Han; Chuan Lin; and Yu He

arXiv:2605.15528·cs.RO·May 18, 2026

Task-Semantic Graph-Driven Distributed Agent Networking for Underwater Target Tracking

Shengchao Zhu, Guangjie Han, Chuan Lin, and Yu He

PDF

1 Repo

TL;DR

This paper introduces an open-source multi-agent reinforcement learning platform for underwater AUV swarm target tracking, featuring a novel semantic policy enhancement and a unified evaluation protocol.

Contribution

It develops the first open-source platform connecting MARL frameworks with physically modeled AUV swarm tasks and proposes a semantic policy method for improved coordination.

Findings

01

The platform enables fair comparison of RL algorithms in underwater scenarios.

02

STG-MAPPO improves task phase recognition and cooperation among AUVs.

03

The code is publicly available for research use.

Abstract

Autonomous underwater vehicle (AUV) swarms are emerging as intelligent underwater networks, where each node must sense, communicate, process local data, and make decisions under severe acoustic constraints. Persistent underwater target tracking is a typical task with moving targets, changing communication topology, intermittent acoustic links, and limited observation for each AUV. Multi-agent reinforcement learning (MARL) is a natural candidate for distributed tracking, yet existing studies still lack a unified open-source platform for evaluating different MARL algorithms under six-degree-of-freedom AUV dynamics. In addition, policies trained with raw geometric states and low-level force actions often struggle to represent task phases, observation reliability, link quality, and local cooperation roles. This paper addresses these issues by developing an open-source MARL-AUV platform that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dasjsaj/MARL-AUV
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.