Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game

Xiang Yuming; Li Sizhao; Li Rongpeng; Zhao Zhifeng; Zhang Honggang

arXiv:2506.18126·cs.AI·June 24, 2025

Decentralized Consensus Inference-based Hierarchical Reinforcement Learning for Multi-Constrained UAV Pursuit-Evasion Game

Xiang Yuming, Li Sizhao, Li Rongpeng, Zhao Zhifeng, Zhang Honggang

PDF

TL;DR

This paper introduces a hierarchical reinforcement learning framework for multi-UAV pursuit-evasion games, enabling decentralized consensus and improved collaboration under communication constraints.

Contribution

It proposes a novel two-level CI-HRL framework with a consensus-oriented multi-agent communication module and an alternative training-based policy optimization method.

Findings

01

Enhanced swarm collaboration and evasion capabilities in simulations

02

Effective consensus formation among UAV agents

03

Superior performance compared to existing methods

Abstract

Multiple quadrotor unmanned aerial vehicle (UAV) systems have garnered widespread research interest and fostered tremendous interesting applications, especially in multi-constrained pursuit-evasion games (MC-PEG). The Cooperative Evasion and Formation Coverage (CEFC) task, where the UAV swarm aims to maximize formation coverage across multiple target zones while collaboratively evading predators, belongs to one of the most challenging issues in MC-PEG, especially under communication-limited constraints. This multifaceted problem, which intertwines responses to obstacles, adversaries, target zones, and formation dynamics, brings up significant high-dimensional complications in locating a solution. In this paper, we propose a novel two-level framework (i.e., Consensus Inference-based Hierarchical Reinforcement Learning (CI-HRL)), which delegates target localization to a high-level policy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.