When Learning Meets Dynamics: Distributed User Connectivity Maximization in UAV-Based Communication Networks
Bowei Li, Saugat Tripathi, Salman Hosain, Ran Zhang, Jiang (Linda), Xie, Miao Wang

TL;DR
This paper introduces a distributed deep reinforcement learning framework for maximizing user connectivity in UAV-based communication networks, effectively handling network dynamics and UAV mobility through novel algorithms and information exchange strategies.
Contribution
It proposes DUCM-1 and DUCM-2 algorithms within a multi-agent deep Q learning framework, addressing the challenges of dynamics and information exchange in UAV networks.
Findings
Exchanging state information improves convergence.
DUCM-2 effectively manages UAV dynamics.
The proposed methods outperform baseline approaches.
Abstract
Distributed management over Unmanned Aerial Vehicle (UAV) based communication networks (UCNs) has attracted increasing research attention. In this work, we study a distributed user connectivity maximization problem in a UCN. The work features a horizontal study over different levels of information exchange during the distributed iteration and a consideration of dynamics in UAV set and user distribution, which are not well addressed in the existing works. Specifically, the studied problem is first formulated into a time-coupled mixed-integer non-convex optimization problem. A heuristic two-stage UAV-user association policy is proposed to faster determine the user connectivity. To tackle the NP-hard problem in scalable manner, the distributed user connectivity maximization algorithm 1 (DUCM-1) is proposed under the multi-agent deep Q learning (MA-DQL) framework. DUCM-1 emphasizes on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Distributed Control Multi-Agent Systems · Opportunistic and Delay-Tolerant Networks
