Path Design for Cellular-Connected UAV with Reinforcement Learning
Yong Zeng, Xiaoli Xu

TL;DR
This paper introduces a reinforcement learning-based method for designing UAV paths that optimize mission time and connectivity, effectively avoiding coverage gaps in complex urban environments.
Contribution
It proposes a novel RL algorithm using temporal-difference learning and tile coding for efficient UAV path planning in cellular networks.
Findings
Successfully avoids coverage holes in urban environments
Handles large state spaces with linear function approximation
Suitable for online and offline path planning
Abstract
This paper studies the path design problem for cellular-connected unmanned aerial vehicle (UAV), which aims to minimize its mission completion time while maintaining good connectivity with the cellular network. We first argue that the conventional path design approach via formulating and solving optimization problems faces several practical challenges, and then propose a new reinforcement learning-based UAV path design algorithm by applying \emph{temporal-difference} method to directly learn the \emph{state-value function} of the corresponding Markov Decision Process. The proposed algorithm is further extended by using linear function approximation with tile coding to deal with large state space. The proposed algorithms only require the raw measured or simulation-generated signal strength as the input and are suitable for both online and offline implementations. Numerical results show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Smart Parking Systems Research · Distributed Control Multi-Agent Systems
