A Novel Joint DRL-Based Utility Optimization for UAV Data Services
Xuli Cai, Poonam Lohan, Burak Kantarci

TL;DR
This paper introduces a joint deep reinforcement learning approach using DQN and DDPG algorithms to optimize resource allocation in UAV-assisted communication networks, significantly increasing user service capacity.
Contribution
It presents a novel combined DRL framework for dynamic bandwidth and power allocation in UAV networks, improving efficiency over traditional static methods.
Findings
Up to 41% increase in users served.
Effective adaptation to fading conditions.
Dynamic resource management outperforms static allocation.
Abstract
In this paper, we propose a novel joint deep reinforcement learning (DRL)-based solution to optimize the utility of an uncrewed aerial vehicle (UAV)-assisted communication network. To maximize the number of users served within the constraints of the UAV's limited bandwidth and power resources, we employ deep Q-Networks (DQN) and deep deterministic policy gradient (DDPG) algorithms for optimal resource allocation to ground users with heterogeneous data rate demands. The DQN algorithm dynamically allocates multiple bandwidth resource blocks to different users based on current demand and available resource states. Simultaneously, the DDPG algorithm manages power allocation, continuously adjusting power levels to adapt to varying distances and fading conditions, including Rayleigh fading for non-line-of-sight (NLoS) links and Rician fading for line-of-sight (LoS) links. Our joint DRL-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Air Traffic Management and Optimization · Vehicular Ad Hoc Networks (VANETs)
