Continuous Transfer Learning for UAV Communication-aware Trajectory Design
Chenrui Sun, Gianluca Fontanesi, Swarna Bindu Chetty, Xuanyu, Liang, Berk Canberk, Hamed Ahmadi

TL;DR
This paper introduces a continuous transfer learning approach using a double deep Q network to efficiently adapt UAV trajectories to new environments, significantly reducing retraining time and improving learning speed.
Contribution
It proposes a novel CTL method that transfers learned weights and adapts learning parameters for UAV trajectory planning across diverse urban environments.
Findings
CTL accelerates convergence by 65% in similar environments
It enables 35% faster adaptation in dissimilar environments
The approach improves success rates over models trained from scratch
Abstract
Deep Reinforcement Learning (DRL) emerges as a prime solution for Unmanned Aerial Vehicle (UAV) trajectory planning, offering proficiency in navigating high-dimensional spaces, adaptability to dynamic environments, and making sequential decisions based on real-time feedback. Despite these advantages, the use of DRL for UAV trajectory planning requires significant retraining when the UAV is confronted with a new environment, resulting in wasted resources and time. Therefore, it is essential to develop techniques that can reduce the overhead of retraining DRL models, enabling them to adapt to constantly changing environments. This paper presents a novel method to reduce the need for extensive retraining using a double deep Q network (DDQN) model as a pretrained base, which is subsequently adapted to different urban environments through Continuous Transfer Learning (CTL). Our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Machine Learning and ELM · Robotics and Sensor-Based Localization
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
