Embedded Mean Field Reinforcement Learning for Perimeter-defense Game
Li Wang, Xin Yu, Xuxin Lv, Gangzheng Ai, Wenjun Wu

TL;DR
This paper develops a scalable reinforcement learning framework, EMFAC, for large-scale, realistic perimeter-defense scenarios involving heterogeneous UAVs, motion dynamics, and environmental factors, validated through extensive simulations and real-world tests.
Contribution
It introduces EMFAC, a novel mean-field reinforcement learning approach with attention mechanisms, tailored for complex, large-scale perimeter-defense tasks with heterogeneous agents and realistic dynamics.
Findings
EMFAC outperforms baseline methods in convergence speed and performance.
The framework effectively handles large-scale heterogeneous control challenges.
Real-world experiments confirm practical applicability and robustness.
Abstract
With the rapid advancement of unmanned aerial vehicles (UAVs) and missile technologies, perimeter-defense game between attackers and defenders for the protection of critical regions have become increasingly complex and strategically significant across a wide range of domains. However, existing studies predominantly focus on small-scale, simplified two-dimensional scenarios, often overlooking realistic environmental perturbations, motion dynamics, and inherent heterogeneity--factors that pose substantial challenges to real-world applicability. To bridge this gap, we investigate large-scale heterogeneous perimeter-defense game in a three-dimensional setting, incorporating realistic elements such as motion dynamics and wind fields. We derive the Nash equilibrium strategies for both attackers and defenders, characterize the victory regions, and validate our theoretical findings through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics
MethodsSoftmax · Attention Is All You Need · Focus · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
