Reinforcement Learning for Decentralized Trajectory Design in Cellular   UAV Networks with Sense-and-Send Protocol

Jingzhi Hu; Hongliang Zhang; and Lingyang Song

arXiv:1809.02934·eess.SP·September 11, 2018·IEEE Internet Things J.

Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks with Sense-and-Send Protocol

Jingzhi Hu, Hongliang Zhang, and Lingyang Song

PDF

Open Access

TL;DR

This paper introduces a decentralized reinforcement learning approach for UAVs to optimize their trajectories in real-time sensing tasks over cellular networks, improving efficiency and data transmission success.

Contribution

It proposes a novel sense-and-send protocol and an enhanced multi-UAV Q-learning algorithm for decentralized trajectory design in dynamic environments.

Findings

01

Faster convergence of the proposed algorithm.

02

Higher utility achieved in real-time sensing scenarios.

03

Effective coordination among multiple UAVs.

Abstract

Recently, the unmanned aerial vehicles (UAVs) have been widely used in real-time sensing applications over cellular networks, which sense the conditions of the tasks and transmit the real-time sensory data to the base station (BS). The performance of a UAV is determined by the performance of both its sensing and transmission processes, which are influenced by the trajectory of the UAV. However, it is challenging for UAVs to design their trajectories efficiently, since they work in a dynamic environment. To tackle this challenge, in this paper, we adopt the reinforcement learning framework to solve the UAV trajectory design problem in a decentralized manner. To coordinate multiple UAVs performing the real-time sensing tasks, we first propose a sense-and-send protocol, and analyze the probability for successful valid data transmission using nested Markov chains. Then, we formulate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Distributed Control Multi-Agent Systems · Energy Harvesting in Wireless Networks