Enhancing UAV Path Planning Efficiency Through Accelerated Learning
Joseanne Viana, Boris Galkin, Lester Ho, Holger Claussen

TL;DR
This paper introduces a novel learning algorithm that significantly accelerates UAV path planning convergence by integrating dimensionality reduction, experience replay, and combined loss functions into a DRL framework.
Contribution
It presents a new approach that reduces training time and storage requirements for UAV path planning using advanced DRL techniques and data processing methods.
Findings
Training episodes reduced by approximately four times.
Enhanced convergence speed in UAV path planning algorithms.
Improved efficiency with lower memory and storage demands.
Abstract
Unmanned Aerial Vehicles (UAVs) are increasingly essential in various fields such as surveillance, reconnaissance, and telecommunications. This study aims to develop a learning algorithm for the path planning of UAV wireless communication relays, which can reduce storage requirements and accelerate Deep Reinforcement Learning (DRL) convergence. Assuming the system possesses terrain maps of the area and can estimate user locations using localization algorithms or direct GPS reporting, it can input these parameters into the learning algorithms to achieve optimized path planning performance. However, higher resolution terrain maps are necessary to extract topological information such as terrain height, object distances, and signal blockages. This requirement increases memory and storage demands on UAVs while also lengthening convergence times in DRL algorithms. Similarly, defining the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Control and Dynamics of Mobile Robots
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Adam · Clipped Double Q-learning · Target Policy Smoothing · Experience Replay · Dense Connections · Twin Delayed Deep Deterministic · Greedy Policy Search · Prioritized Experience Replay
