Hierarchical Reinforcement Learning for Swarm Confrontation with High   Uncertainty

Qizhen Wu; Kexin Liu; Lei Chen; Jinhu L\"u

arXiv:2406.07877·cs.RO·October 28, 2024

Hierarchical Reinforcement Learning for Swarm Confrontation with High Uncertainty

Qizhen Wu, Kexin Liu, Lei Chen, Jinhu L\"u

PDF

Open Access 1 Repo

TL;DR

This paper introduces a hierarchical reinforcement learning framework for swarm confrontation tasks under high uncertainty, effectively decoupling decision processes and improving training stability and performance in dynamic, multi-agent environments.

Contribution

It proposes a novel hierarchical RL architecture with uncertainty quantification and an integrated training method, advancing swarm confrontation strategies under uncertain conditions.

Findings

01

Achieves around 90% win rate in multi-agent confrontation scenarios.

02

Outperforms traditional methods in effectiveness and generalization.

03

Demonstrates robustness and stability in real-robot experiments.

Abstract

In swarm robotics, confrontation including the pursuit-evasion game is a key scenario. High uncertainty caused by unknown opponents' strategies, dynamic obstacles, and insufficient training complicates the action space into a hybrid decision process. Although the deep reinforcement learning method is significant for swarm confrontation since it can handle various sizes, as an end-to-end implementation, it cannot deal with the hybrid process. Here, we propose a novel hierarchical reinforcement learning approach consisting of a target allocation layer, a path planning layer, and the underlying dynamic interaction mechanism between the two layers, which indicates the quantified uncertainty. It decouples the hybrid process into discrete allocation and continuous planning layers, with a probabilistic ensemble model to quantify the uncertainty and regulate the interaction frequency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Wu-duanduan/Swarm_confrontation_HRL
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpinion Dynamics and Social Influence · Distributed Control Multi-Agent Systems · Mathematical and Theoretical Epidemiology and Ecology Models