Reinforcement Learning-based Joint Path and Energy Optimization of Cellular-Connected Unmanned Aerial Vehicles
Arash Hooshmand

TL;DR
This paper presents a reinforcement learning-based method for optimizing the path and energy management of cellular-connected UAVs, enabling long-range missions with recharging stations while avoiding no-fly zones.
Contribution
It introduces a hierarchical reinforcement learning approach that extends short-range path planning to include energy recharge and no-fly zone constraints for UAVs.
Findings
Q-learning enables optimal path and recharge policies.
The method supports large-area UAV missions.
Simulation results demonstrate effective energy and path optimization.
Abstract
Unmanned Aerial Vehicles (UAVs) have attracted considerable research interest recently. Especially when it comes to the realm of Internet of Things, the UAVs with Internet connectivity are one of the main demands. Furthermore, the energy constraint i.e. battery limit is a bottle-neck of the UAVs that can limit their applications. We try to address and solve the energy problem. Therefore, a path planning method for a cellular-connected UAV is proposed that will enable the UAV to plan its path in an area much larger than its battery range by getting recharged in certain positions equipped with power stations (PSs). In addition to the energy constraint, there are also no-fly zones; for example, due to Air to Air (A2A) and Air to Ground (A2G) interference or for lack of necessary connectivity that impose extra constraints in the trajectory optimization of the UAV. No-fly zones determine the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Robotic Path Planning Algorithms · Distributed Control Multi-Agent Systems
MethodsQ-Learning
