Responsive Regulation of Dynamic UAV Communication Networks Based on Deep Reinforcement Learning
Ran Zhang, Duc Minh (Aaron) Nguyen, Miao Wang, Lin X. Cai, Xuemin, (Sherman) Shen

TL;DR
This paper presents a deep reinforcement learning framework for proactively managing UAV communication networks, enabling UAVs to anticipate and adapt to dynamic changes in UAV lineup and user distribution for improved user satisfaction.
Contribution
It introduces a novel DRL-based control framework with a state transition design and an asynchronous DDPG algorithm to handle dynamic UAV network environments proactively.
Findings
The proposed method converges reliably in simulations.
It outperforms passive reaction strategies in dynamic scenarios.
The framework effectively manages changes in UAV lineup and user distribution.
Abstract
In this chapter, the regulation of Unmanned Aerial Vehicle (UAV) communication network is investigated in the presence of dynamic changes in the UAV lineup and user distribution. We target an optimal UAV control policy which is capable of identifying the upcoming change in the UAV lineup (quit or join-in) or user distribution, and proactively relocating the UAVs ahead of the change rather than passively dispatching the UAVs after the change. Specifically, a deep reinforcement learning (DRL)-based UAV control framework is developed to maximize the accumulated user satisfaction (US) score for a given time horizon which is able to handle the change in both the UAV lineup and user distribution. The framework accommodates the changed dimension of the state-action space before and after the UAV lineup change by deliberate state transition design. In addition, to handle the continuous state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Distributed Control Multi-Agent Systems · Advanced MIMO Systems Optimization
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Weight Decay · Adam · Experience Replay · Convolution · Batch Normalization · Dense Connections · Deep Deterministic Policy Gradient
