Embedded Mean Field Reinforcement Learning for Perimeter-defense Game

Li Wang; Xin Yu; Xuxin Lv; Gangzheng Ai; Wenjun Wu

arXiv:2505.14209·cs.AI·May 21, 2025

Embedded Mean Field Reinforcement Learning for Perimeter-defense Game

Li Wang, Xin Yu, Xuxin Lv, Gangzheng Ai, Wenjun Wu

PDF

Open Access

TL;DR

This paper develops a scalable reinforcement learning framework, EMFAC, for large-scale, realistic perimeter-defense scenarios involving heterogeneous UAVs, motion dynamics, and environmental factors, validated through extensive simulations and real-world tests.

Contribution

It introduces EMFAC, a novel mean-field reinforcement learning approach with attention mechanisms, tailored for complex, large-scale perimeter-defense tasks with heterogeneous agents and realistic dynamics.

Findings

01

EMFAC outperforms baseline methods in convergence speed and performance.

02

The framework effectively handles large-scale heterogeneous control challenges.

03

Real-world experiments confirm practical applicability and robustness.

Abstract

With the rapid advancement of unmanned aerial vehicles (UAVs) and missile technologies, perimeter-defense game between attackers and defenders for the protection of critical regions have become increasingly complex and strategically significant across a wide range of domains. However, existing studies predominantly focus on small-scale, simplified two-dimensional scenarios, often overlooking realistic environmental perturbations, motion dynamics, and inherent heterogeneity--factors that pose substantial challenges to real-world applicability. To bridge this gap, we investigate large-scale heterogeneous perimeter-defense game in a three-dimensional setting, incorporating realistic elements such as motion dynamics and wind fields. We derive the Nash equilibrium strategies for both attackers and defenders, characterize the victory regions, and validate our theoretical findings through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvacuation and Crowd Dynamics

MethodsSoftmax · Attention Is All You Need · Focus · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings