Dynamic Trajectory and Power Control in Ultra-Dense UAV Networks: A Mean-Field Reinforcement Learning Approach
Fei Song, Zhe Wang, Jun Li, Long Shi, Wen Chen, Shi Jin

TL;DR
This paper introduces a mean-field reinforcement learning approach for dynamic trajectory and power control in ultra-dense UAV networks, effectively managing resources and interference with scalable algorithms.
Contribution
It formulates a mean-field game model for UAV resource management and develops a novel ME-MFDQN algorithm for scalable, efficient control in large UAV networks.
Findings
Improves energy efficiency over benchmark algorithms.
Performance gains with higher GUs' demand correlation.
Enhanced results with wider UAV observation capabilities.
Abstract
In ultra-dense unmanned aerial vehicle (UAV) networks, it is challenging to coordinate the resource allocation and interference management among large-scale UAVs, for providing flexible and efficient service coverage to the ground users (GUs). In this paper, we propose a learning-based resource allocation scheme in an ultra-dense UAV communication network, where the GUs' service demands are time-varying with unknown distributions. We formulate the non-cooperative game among multiple co-channel UAVs as a stochastic game, where each UAV jointly optimizes its trajectory, user association, and downlink power control to maximize the expectation of its locally cumulative energy efficiency under the interference and energy constraints. To cope with the scalability issue in a large-scale network, we further formulate the problem as a mean-field game (MFG), which simplifies the interactions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · UAV Applications and Optimization · Adaptive Dynamic Programming Control
Methodstravel james
