Mean Field Game and Decentralized Intelligent Adaptive Pursuit Evasion Strategy for Massive Multi-Agent System under Uncertain Environment
Zejian Zhou, Hao Xu

TL;DR
This paper introduces a decentralized adaptive pursuit-evasion strategy for large multi-agent systems using mean field game theory combined with reinforcement learning, effectively handling uncertainty and scalability issues.
Contribution
It develops a novel Actor-Critic-Mass neural network architecture integrating mean field game theory with reinforcement learning for large-scale multi-agent pursuit-evasion.
Findings
Strategy effectively handles massive agent populations.
Neural networks accurately approximate mean field and optimal policies.
Numerical simulations demonstrate high effectiveness and adaptability.
Abstract
In this paper, a novel decentralized intelligent adaptive optimal strategy has been developed to solve the pursuit-evasion game for massive Multi-Agent Systems (MAS) under uncertain environment. Existing strategies for pursuit-evasion games are neither efficient nor practical for large population multi-agent system due to the notorious "Curse of dimensionality" and communication limit while the agent population is large. To overcome these challenges, the emerging mean field game theory is adopted and further integrated with reinforcement learning to develop a novel decentralized intelligent adaptive strategy with a new type of adaptive dynamic programing architecture named the Actor-Critic-Mass (ACM). Through online approximating the solution of the coupled mean field equations, the developed strategy can obtain the optimal pursuit-evasion policy even for massive MAS under uncertain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGuidance and Control Systems · Adaptive Dynamic Programming Control · Mathematical and Theoretical Epidemiology and Ecology Models
