Multi-AUV Cooperative Target Tracking Based on Supervised Diffusion-Aided Multi-Agent Reinforcement Learning

Jiaao Ma; Chuan Lin; Guangjie Han; Shengchao Zhu; Zhenyu Wang; and Chen An

arXiv:2603.29426·cs.NI·April 1, 2026

Multi-AUV Cooperative Target Tracking Based on Supervised Diffusion-Aided Multi-Agent Reinforcement Learning

Jiaao Ma, Chuan Lin, Guangjie Han, Shengchao Zhu, Zhenyu Wang, and Chen An

PDF

TL;DR

This paper introduces a hierarchical multi-agent reinforcement learning framework with diffusion-based supervised learning to improve cooperative target tracking by autonomous underwater vehicles under challenging conditions.

Contribution

It proposes a novel SDA-MARL algorithm with structured experience replay, diffusion-guided training, and disturbance-robust policies for enhanced underwater multi-AUV coordination.

Findings

01

Achieves higher tracking precision than existing methods in simulations.

02

Effectively mitigates non-stationarity and sparse reward issues.

03

Demonstrates robustness under hydrodynamic disturbances.

Abstract

In recent years, advances in underwater networking and multi-agent reinforcement learning (MARL) have significantly expanded multi-autonomous underwater vehicle (AUV) applications in marine exploration and target tracking. However, current MARL-driven cooperative tracking faces three critical challenges: 1) non-stationarity in decentralized coordination, where local policy updates destabilize teammates' observation spaces, preventing convergence; 2) sparse-reward exploration inefficiency from limited underwater visibility and constrained sensor ranges, causing high-variance learning; and 3) water disturbance fragility combined with handcrafted reward dependency that degrades real-world robustness under unmodeled hydrodynamic conditions. To address these challenges, this paper proposes a hierarchical MARL architecture comprising four layers: global training scheduling, multi-agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.