Analysis of Reinforcement Learning Schemes for Trajectory Optimization   of an Aerial Radio Unit

Hossein Mohammadi; Vuk Marojevic; Bodong Shang

arXiv:2211.10524·eess.SY·November 22, 2022

Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit

Hossein Mohammadi, Vuk Marojevic, Bodong Shang

PDF

Open Access

TL;DR

This paper explores AI-driven trajectory optimization for UAV-based aerial radio units in wireless networks, comparing Q-learning and SARSA algorithms for maximizing network throughput in different environment sizes.

Contribution

It introduces a novel analysis of UAV trajectory optimization using reinforcement learning in the context of aerial radio units for wireless networks.

Findings

01

MDP-based trajectory planning outperforms in small environments

02

SARSA achieves better results in larger environments

03

Reinforcement learning effectively optimizes UAV paths for network throughput

Abstract

This paper introduces the deployment of unmanned aerial vehicles (UAVs) as lightweight wireless access points that leverage the fixed infrastructure in the context of the emerging open radio access network (O-RAN). More precisely, we propose an aerial radio unit that dynamically serves an under served area and connects to the distributed unit via a wireless fronthaul between the UAV and the closest tower. In this paper we analyze the UAV trajectory in terms of artificial intelligence (AI) when it serves both UEs and central units (CUs) at the same time in multi input multi output (MIMO) fading channel. We first demonstrate the nonconvexity of the problem of maximizing the overall network throughput based on UAV location, and then we use two different machine learning approaches to solve it. We first assume that the environment is a gridworld and then let the UAV explore the environment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsUAV Applications and Optimization · Satellite Communication Systems · Advanced MIMO Systems Optimization

MethodsSarsa · Q-Learning