DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV
Xiao Mao, Zhiguang Cao, Mingfeng Fan, Guohua Wu, and Witold Pedrycz

TL;DR
This paper introduces DL-DRL, a double-level deep reinforcement learning framework that efficiently solves large-scale multi-UAV task scheduling by decomposing the problem into task allocation and route planning, outperforming traditional methods.
Contribution
The paper proposes a novel double-level DRL approach with an interactive training strategy, enabling scalable and efficient multi-UAV task scheduling for large problem sizes.
Findings
DL-DRL outperforms traditional heuristics and learning-based methods in solution quality.
The approach maintains high performance on problems with up to 1000 tasks.
The interactive training strategy improves training efficiency and solution balance.
Abstract
Exploiting unmanned aerial vehicles (UAVs) to execute tasks is gaining growing popularity recently. To solve the underlying task scheduling problem, the deep reinforcement learning (DRL) based methods demonstrate notable advantage over the conventional heuristics as they rely less on hand-engineered rules. However, their decision space will become prohibitively huge as the problem scales up, thus deteriorating the computation efficiency. To alleviate this issue, we propose a double-level deep reinforcement learning (DL-DRL) approach based on a divide and conquer framework (DCF), where we decompose the task scheduling of multi-UAV into task allocation and route planning. Particularly, we design an encoder-decoder structured policy network in our upper-level DRL model to allocate the tasks to different UAVs, and we exploit another attention based policy network in our lower-level DRL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms
MethodsSelf-Learning
