Enhancing UAV Path Planning Efficiency Through Accelerated Learning

Joseanne Viana; Boris Galkin; Lester Ho; Holger Claussen

arXiv:2501.10141·cs.LG·January 20, 2025

Enhancing UAV Path Planning Efficiency Through Accelerated Learning

Joseanne Viana, Boris Galkin, Lester Ho, Holger Claussen

PDF

Open Access

TL;DR

This paper introduces a novel learning algorithm that significantly accelerates UAV path planning convergence by integrating dimensionality reduction, experience replay, and combined loss functions into a DRL framework.

Contribution

It presents a new approach that reduces training time and storage requirements for UAV path planning using advanced DRL techniques and data processing methods.

Findings

01

Training episodes reduced by approximately four times.

02

Enhanced convergence speed in UAV path planning algorithms.

03

Improved efficiency with lower memory and storage demands.

Abstract

Unmanned Aerial Vehicles (UAVs) are increasingly essential in various fields such as surveillance, reconnaissance, and telecommunications. This study aims to develop a learning algorithm for the path planning of UAV wireless communication relays, which can reduce storage requirements and accelerate Deep Reinforcement Learning (DRL) convergence. Assuming the system possesses terrain maps of the area and can estimate user locations using localization algorithms or direct GPS reporting, it can input these parameters into the learning algorithms to achieve optimized path planning performance. However, higher resolution terrain maps are necessary to extract topological information such as terrain height, object distances, and signal blockages. This requirement increases memory and storage demands on UAVs while also lengthening convergence times in DRL algorithms. Similarly, defining the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Control and Dynamics of Mobile Robots

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Adam · Clipped Double Q-learning · Target Policy Smoothing · Experience Replay · Dense Connections · Twin Delayed Deep Deterministic · Greedy Policy Search · Prioritized Experience Replay