Scalable UAV Multi-Hop Networking via Multi-Agent Reinforcement Learning with Large Language Models

Yanggang Xu; Jirong Zha; Weijie Hong; Xiangmin Yi; Geng Chen; Jianfeng Zheng; Chen-Chun Hsia; Xinlei Chen

arXiv:2505.08448·cs.MA·March 19, 2026

Scalable UAV Multi-Hop Networking via Multi-Agent Reinforcement Learning with Large Language Models

Yanggang Xu, Jirong Zha, Weijie Hong, Xiangmin Yi, Geng Chen, Jianfeng Zheng, Chen-Chun Hsia, Xinlei Chen

PDF

TL;DR

This paper introduces MRLMN, a scalable framework combining multi-agent reinforcement learning and large language models to optimize UAV multi-hop networks in disaster scenarios, improving coverage and robustness.

Contribution

The paper presents a novel integration of LLMs with MARL for UAV networking, including a grouping strategy, reward decomposition, behavioral constraints, and knowledge distillation techniques.

Findings

01

Significant performance improvements over MAPPO baseline.

02

Enhanced network coverage and communication quality.

03

Effective scalability in large dynamic environments.

Abstract

In disaster scenarios, establishing robust emergency communication networks is critical, and unmanned aerial vehicles (UAVs) offer a promising solution to rapidly restore connectivity. However, organizing UAVs to form multi-hop networks in large-scale dynamic environments presents significant challenges, including limitations in algorithmic scalability and the vast exploration space required for coordinated decision-making. To address these issues, we propose MRLMN, a novel framework that integrates multi-agent reinforcement learning (MARL) and large language models (LLMs) to jointly optimize UAV agents toward achieving optimal networking performance. The framework incorporates a grouping strategy with reward decomposition to enhance algorithmic scalability and balance decision-making across UAVs. In addition, behavioral constraints are applied to selected key UAVs to improve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.