Online Planning for Multi-UAV Pursuit-Evasion in Unknown Environments Using Deep Reinforcement Learning

Jiayu Chen; Chao Yu; Guosheng Li; Wenhao Tang; Shilong Ji; Xinyi Yang; Botian Xu; Huazhong Yang; Yu Wang

arXiv:2409.15866·cs.RO·July 9, 2025

Online Planning for Multi-UAV Pursuit-Evasion in Unknown Environments Using Deep Reinforcement Learning

Jiayu Chen, Chao Yu, Guosheng Li, Wenhao Tang, Shilong Ji, Xinyi Yang, Botian Xu, Huazhong Yang, Yu Wang

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning approach for multi-UAV pursuit-evasion in unknown environments, incorporating UAV dynamics, partial observability, and adaptive training to achieve high success rates and real-world deployment.

Contribution

It introduces an evader prediction network, adaptive environment generation, and demonstrates zero-shot deployment on real quadrotors, advancing RL application in complex UAV scenarios.

Findings

01

Achieves 100% capture rate in simulations.

02

Outperforms baseline methods in diverse scenarios.

03

Successfully deploys policies on real UAVs without retraining.

Abstract

Multi-UAV pursuit-evasion, where pursuers aim to capture evaders, poses a key challenge for UAV swarm intelligence. Multi-agent reinforcement learning (MARL) has demonstrated potential in modeling cooperative behaviors, but most RL-based approaches remain constrained to simplified simulations with limited dynamics or fixed scenarios. Previous attempts to deploy RL policy to real-world pursuit-evasion are largely restricted to two-dimensional scenarios, such as ground vehicles or UAVs at fixed altitudes. In this paper, we address multi-UAV pursuit-evasion by considering UAV dynamics and physical constraints. We introduce an evader prediction-enhanced network to tackle partial observability in cooperative strategy learning. Additionally, we propose an adaptive environment generator within MARL training, enabling higher exploration efficiency and better policy generalization across diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Robotic Path Planning Algorithms · Military Defense Systems Analysis