Meta-Reinforcement Learning for Trajectory Design in Wireless UAV Networks
Ye Hu, Mingzhe Chen, Walid Saad, H. Vincent Poor, and Shuguang Cui

TL;DR
This paper introduces a meta-reinforcement learning approach to optimize drone trajectories in dynamic wireless networks, enabling quick adaptation and improved performance in unseen environments.
Contribution
It proposes a meta-learning algorithm that enhances reinforcement learning for drone trajectory design, achieving faster convergence and better adaptability in unpredictable network conditions.
Findings
25% faster convergence in training
10% improvement in communication performance
27% increase in serving over 50% user requests
Abstract
In this paper, the design of an optimal trajectory for an energy-constrained drone operating in dynamic network environments is studied. In the considered model, a drone base station (DBS) is dispatched to provide uplink connectivity to ground users whose demand is dynamic and unpredictable. In this case, the DBS's trajectory must be adaptively adjusted to satisfy the dynamic user access requests. To this end, a meta-learning algorithm is proposed in order to adapt the DBS's trajectory when it encounters novel environments, by tuning a reinforcement learning (RL) solution. The meta-learning algorithm provides a solution that adapts the DBS in novel environments quickly based on limited former experiences. The meta-tuned RL is shown to yield a faster convergence to the optimal coverage in unseen environments with a considerably low computation complexity, compared to the baseline policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Energy Harvesting in Wireless Networks · Advanced Wireless Communication Technologies
