Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
Jizhe Dou, Haotian Zhang, Guodong Sun

TL;DR
This paper introduces HaDMC, a hybrid-action deep reinforcement learning framework for optimizing the route and charging schedule of drones and mobile chargers, improving operational efficiency in drone observation tasks.
Contribution
The paper proposes a novel hybrid-action DRL method with an action decoder for cooperative drone-charger scheduling, addressing discrete-continuous action challenges.
Findings
HaDMC outperforms existing DRL methods in efficiency and effectiveness.
The approach enables better cooperation between drone and charger agents.
Numerical experiments validate the proposed framework's superiority.
Abstract
Recently there has been a growing interest in industry and academia, regarding the use of wireless chargers to prolong the operational longevity of unmanned aerial vehicles (commonly knowns as drones). In this paper we consider a charger-assisted drone application: a drone is deployed to observe a set points of interest, while a charger can move to recharge the drone's battery. We focus on the route and charging schedule of the drone and the mobile charger, to obtain high observation utility with the shortest possible time, while ensuring the drone remains operational during task execution. Essentially, this proposed drone-charger scheduling problem is a multi-stage decision-making process, in which the drone and the mobile charger act as two agents who cooperate to finish a task. The discrete-continuous hybrid action space of the two agents poses a significant challenge in our problem.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Reinforcement Learning in Robotics
MethodsSparse Evolutionary Training · Focus
