# AoI-Aware Data Collection in Heterogeneous UAV-Assisted WSNs: Strong-Agent Coordinated Coverage and Vicsek-Driven Weak-Swarm Control

**Authors:** Lin Huang, Lanhua Li, Songhan Zhao, Daiming Qu, Jing Xu

PMC · DOI: 10.3390/s26020419 · Sensors (Basel, Switzerland) · 2026-01-08

## TL;DR

This paper proposes a hierarchical framework using UAV swarms to efficiently collect data from ground users while reducing the age of information.

## Contribution

A novel hierarchical MADRL framework combining strong-agent coordination and weak-swarm control for AoI-aware data collection in UAV-assisted WSNs.

## Key findings

- The proposed SW-MADDPG reduces AoI by 30% compared to the No-Voronoi baseline.
- AoI is reduced by 21% compared to the Heuristic-HUAV baseline.
- The framework scales well with large numbers of low-capability UAVs.

## Abstract

Unmanned aerial vehicle (UAV) swarms offer an efficient solution for data collection from widely distributed ground users (GUs). However, incomplete environment information and frequent changes make it challenging for standard centralized planning or pure reinforcement learning approaches to simultaneously maintain global solution quality and local flexibility. We propose a hierarchical data collection framework for heterogeneous UAV-assisted wireless sensor networks (WSNs). A small set of high-capability UAVs (H-UAVs), equipped with substantial computational and communication resources, coordinate regional coverage, trajectory planning, and uplink transmission control for numerous resource-constrained low-capability UAVs (L-UAVs) across power-Voronoi-partitioned areas using multi-agent deep reinforcement learning (MADRL). Specifically, we employ Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to enhance H-UAVs’ decision-making capabilities and enable coordinated actions. The partitions are dynamically updated based on GUs’ data generation rates and L-UAV density to balance workload and adapt to environmental dynamics. Concurrently, a large number of L-UAVs with limited onboard resources perform self-organized data collection from GUs and execute opportunistic relaying to a remote access point (RAP) via H-UAVs. Within each Voronoi cell, L-UAV motion follows a weighted Vicsek model that incorporates GUs’ age of information (AoI), link quality, and congestion avoidance. This spatial decomposition combined with decentralized weak-swarm control enables scalability to large-scale L-UAV deployments. Experiments demonstrate that the proposed strong and weak agent MADDPG (SW-MADDPG) scheme reduces AoI by 30% and 21% compared to No-Voronoi and Heuristic-HUAV baselines, respectively.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12845569/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12845569/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12845569/full.md

---
Source: https://tomesphere.com/paper/PMC12845569