Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit
Hossein Mohammadi, Vuk Marojevic, Bodong Shang

TL;DR
This paper explores AI-driven trajectory optimization for UAV-based aerial radio units in wireless networks, comparing Q-learning and SARSA algorithms for maximizing network throughput in different environment sizes.
Contribution
It introduces a novel analysis of UAV trajectory optimization using reinforcement learning in the context of aerial radio units for wireless networks.
Findings
MDP-based trajectory planning outperforms in small environments
SARSA achieves better results in larger environments
Reinforcement learning effectively optimizes UAV paths for network throughput
Abstract
This paper introduces the deployment of unmanned aerial vehicles (UAVs) as lightweight wireless access points that leverage the fixed infrastructure in the context of the emerging open radio access network (O-RAN). More precisely, we propose an aerial radio unit that dynamically serves an under served area and connects to the distributed unit via a wireless fronthaul between the UAV and the closest tower. In this paper we analyze the UAV trajectory in terms of artificial intelligence (AI) when it serves both UEs and central units (CUs) at the same time in multi input multi output (MIMO) fading channel. We first demonstrate the nonconvexity of the problem of maximizing the overall network throughput based on UAV location, and then we use two different machine learning approaches to solve it. We first assume that the environment is a gridworld and then let the UAV explore the environment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Satellite Communication Systems · Advanced MIMO Systems Optimization
MethodsSarsa · Q-Learning
