UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement   Learning Approach

Harald Bayerlein; Mirco Theile; Marco Caccamo; David Gesbert

arXiv:2007.00544·cs.LG·January 28, 2021

UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

Harald Bayerlein, Mirco Theile, Marco Caccamo, David Gesbert

PDF

3 Repos

TL;DR

This paper introduces a deep reinforcement learning method for UAV path planning that efficiently adapts to changing scenarios in urban IoT data collection, balancing data gathering, safety, and flight time.

Contribution

It presents a novel DDQN-based approach with environment maps for generalized UAV control in dynamic urban environments, outperforming previous methods.

Findings

01

The proposed method effectively generalizes over different scenario parameters.

02

Using a centered map improves learning efficiency.

03

The approach balances data collection with safety and flight constraints.

Abstract

Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next-generation communication networks requires efficient trajectory planning methods. We propose a new end-to-end reinforcement learning (RL) approach to UAV-enabled data collection from Internet of Things (IoT) devices in an urban environment. An autonomous drone is tasked with gathering data from distributed sensor nodes subject to limited flying time and obstacle avoidance. While previous approaches, learning and non-learning based, must perform expensive recomputations or relearn a behavior when important scenario parameters such as the number of sensors, sensor positions, or maximum flying time, change, we train a double deep Q-network (DDQN) with combined experience replay to learn a UAV control policy that generalizes over changing scenario parameters. By exploiting a multi-layer map of the environment fed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsExperience Replay