Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted Vehicular Networks
Ming Zhu, Xiao-Yang Liu, and Anwar Walid

TL;DR
This paper applies deep reinforcement learning, specifically DDPG, to optimize UAV-assisted vehicular networks by jointly controlling UAV flight and communication to maximize throughput and energy efficiency.
Contribution
It introduces a novel DDPG-based approach for joint UAV control and communication optimization in vehicular networks, considering energy constraints and mobility prediction.
Findings
Proposed algorithms outperform baseline schemes in throughput.
Modified DDPG framework adapts to UAV mobility and energy considerations.
Verification confirms near-optimal performance in simplified models.
Abstract
Unmanned aerial vehicles (UAVs) are envisioned to complement the 5G communication infrastructure in future smart cities. Hot spots easily appear in road intersections, where effective communication among vehicles is challenging. UAVs may serve as relays with the advantages of low price, easy deployment, line-of-sight links, and flexible mobility. In this paper, we study a UAV-assisted vehicular network where the UAV jointly adjusts its transmission control (power and channel) and 3D flight to maximize the total throughput. First, we formulate a Markov decision process (MDP) problem by modeling the mobility of the UAV/vehicles and the state transitions. Secondly, we solve the target problem using a deep reinforcement learning method, namely, the deep deterministic policy gradient (DDPG), and propose three solutions with different control objectives. Deep reinforcement learning methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Vehicular Ad Hoc Networks (VANETs) · Distributed Control Multi-Agent Systems
MethodsExperience Replay · Dense Connections · Weight Decay · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Convolution · Batch Normalization · Deep Deterministic Policy Gradient
