Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach
Yang Wang, Zhen Gao

TL;DR
This paper introduces a deep reinforcement learning method for optimizing the 3D trajectory of UAVs in multi-user MISO communication systems to minimize data transmission time in urban environments.
Contribution
It develops a novel DRL-based trajectory design approach using a deep deterministic policy gradient algorithm with pheromone-based state representation for urban UAV communications.
Findings
DRL-TDCTM outperforms baseline methods in simulations.
The approach effectively adapts UAV trajectories in complex urban environments.
Simulation results demonstrate reduced transmission completion time.
Abstract
In this paper, we investigate a multi-user downlink multiple-input single-output (MISO) unmanned aerial vehicle (UAV) communication system, where a multi-antenna UAV is employed to serve multiple ground terminals. Unlike existing approaches focus only on a simplified two-dimensional scenario, this paper considers a three-dimensional (3D) urban environment, where the UAV's 3D trajectory is designed to minimize data transmission completion time subject to practical throughput and flight movement constraints. Specifically, we propose a deep reinforcement learning (DRL)-based trajectory design for completion time minimization (DRL-TDCTM), which is developed from a deep deterministic policy gradient algorithm. In particular, to represent the state information of UAV and environment, we set an additional information, i.e., the merged pheromone, as a reference of reward which facilitates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Smart Parking Systems Research · Distributed Control Multi-Agent Systems
