Path Design for Cellular-Connected UAV with Reinforcement Learning

Yong Zeng; Xiaoli Xu

arXiv:1905.03440·cs.NI·May 10, 2019·6 cites

Path Design for Cellular-Connected UAV with Reinforcement Learning

Yong Zeng, Xiaoli Xu

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based method for designing UAV paths that optimize mission time and connectivity, effectively avoiding coverage gaps in complex urban environments.

Contribution

It proposes a novel RL algorithm using temporal-difference learning and tile coding for efficient UAV path planning in cellular networks.

Findings

01

Successfully avoids coverage holes in urban environments

02

Handles large state spaces with linear function approximation

03

Suitable for online and offline path planning

Abstract

This paper studies the path design problem for cellular-connected unmanned aerial vehicle (UAV), which aims to minimize its mission completion time while maintaining good connectivity with the cellular network. We first argue that the conventional path design approach via formulating and solving optimization problems faces several practical challenges, and then propose a new reinforcement learning-based UAV path design algorithm by applying \emph{temporal-difference} method to directly learn the \emph{state-value function} of the corresponding Markov Decision Process. The proposed algorithm is further extended by using linear function approximation with tile coding to deal with large state space. The proposed algorithms only require the raw measured or simulation-generated signal strength as the input and are suitable for both online and offline implementations. Numerical results show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Smart Parking Systems Research · Distributed Control Multi-Agent Systems