Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted   Vehicular Networks

Ming Zhu; Xiao-Yang Liu; and Anwar Walid

arXiv:1906.05015·cs.LG·February 22, 2023·22 cites

Deep Reinforcement Learning for Unmanned Aerial Vehicle-Assisted Vehicular Networks

Ming Zhu, Xiao-Yang Liu, and Anwar Walid

PDF

Open Access

TL;DR

This paper applies deep reinforcement learning, specifically DDPG, to optimize UAV-assisted vehicular networks by jointly controlling UAV flight and communication to maximize throughput and energy efficiency.

Contribution

It introduces a novel DDPG-based approach for joint UAV control and communication optimization in vehicular networks, considering energy constraints and mobility prediction.

Findings

01

Proposed algorithms outperform baseline schemes in throughput.

02

Modified DDPG framework adapts to UAV mobility and energy considerations.

03

Verification confirms near-optimal performance in simplified models.

Abstract

Unmanned aerial vehicles (UAVs) are envisioned to complement the 5G communication infrastructure in future smart cities. Hot spots easily appear in road intersections, where effective communication among vehicles is challenging. UAVs may serve as relays with the advantages of low price, easy deployment, line-of-sight links, and flexible mobility. In this paper, we study a UAV-assisted vehicular network where the UAV jointly adjusts its transmission control (power and channel) and 3D flight to maximize the total throughput. First, we formulate a Markov decision process (MDP) problem by modeling the mobility of the UAV/vehicles and the state transitions. Secondly, we solve the target problem using a deep reinforcement learning method, namely, the deep deterministic policy gradient (DDPG), and propose three solutions with different control objectives. Deep reinforcement learning methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Vehicular Ad Hoc Networks (VANETs) · Distributed Control Multi-Agent Systems

MethodsExperience Replay · Dense Connections · Weight Decay · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Convolution · Batch Normalization · Deep Deterministic Policy Gradient