Three-Dimensional Trajectory Design for Multi-User MISO UAV   Communications: A Deep Reinforcement Learning Approach

Yang Wang; Zhen Gao

arXiv:2108.00668·cs.IT·August 3, 2021

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach

Yang Wang, Zhen Gao

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning method for optimizing the 3D trajectory of UAVs in multi-user MISO communication systems to minimize data transmission time in urban environments.

Contribution

It develops a novel DRL-based trajectory design approach using a deep deterministic policy gradient algorithm with pheromone-based state representation for urban UAV communications.

Findings

01

DRL-TDCTM outperforms baseline methods in simulations.

02

The approach effectively adapts UAV trajectories in complex urban environments.

03

Simulation results demonstrate reduced transmission completion time.

Abstract

In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL-TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Smart Parking Systems Research · Distributed Control Multi-Agent Systems