Continuous Transfer Learning for UAV Communication-aware Trajectory   Design

Chenrui Sun; Gianluca Fontanesi; Swarna Bindu Chetty; Xuanyu; Liang; Berk Canberk; Hamed Ahmadi

arXiv:2405.10087·eess.SP·May 17, 2024·WINCOM

Continuous Transfer Learning for UAV Communication-aware Trajectory Design

Chenrui Sun, Gianluca Fontanesi, Swarna Bindu Chetty, Xuanyu, Liang, Berk Canberk, Hamed Ahmadi

PDF

Open Access

TL;DR

This paper introduces a continuous transfer learning approach using a double deep Q network to efficiently adapt UAV trajectories to new environments, significantly reducing retraining time and improving learning speed.

Contribution

It proposes a novel CTL method that transfers learned weights and adapts learning parameters for UAV trajectory planning across diverse urban environments.

Findings

01

CTL accelerates convergence by 65% in similar environments

02

It enables 35% faster adaptation in dissimilar environments

03

The approach improves success rates over models trained from scratch

Abstract

Deep Reinforcement Learning (DRL) emerges as a prime solution for Unmanned Aerial Vehicle (UAV) trajectory planning, offering proficiency in navigating high-dimensional spaces, adaptability to dynamic environments, and making sequential decisions based on real-time feedback. Despite these advantages, the use of DRL for UAV trajectory planning requires significant retraining when the UAV is confronted with a new environment, resulting in wasted resources and time. Therefore, it is essential to develop techniques that can reduce the overhead of retraining DRL models, enabling them to adapt to constantly changing environments. This paper presents a novel method to reduce the need for extensive retraining using a double deep Q network (DDQN) model as a pretrained base, which is subsequently adapted to different urban environments through Continuous Transfer Learning (CTL). Our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Machine Learning and ELM · Robotics and Sensor-Based Localization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings