MAPEL: Multi-Agent Pursuer-Evader Learning using Situation Report
Sagar Verma, Richa Verma, P.B. Sujit

TL;DR
This paper introduces MAPEL, a decentralized multi-agent learning framework for pursuit-evasion games with obstacles, using situation reports and RNNs to enable effective cooperation among agents.
Contribution
The paper proposes a novel decentralized learning approach using situation reports and spatio-temporal graphs for multi-agent pursuit-evasion tasks.
Findings
MAPEL effectively enables agent cooperation in complex environments.
Peer-to-Peer and Ring Situation Report methods improve coordination.
Performance scales with the number of agents in the game.
Abstract
In this paper, we consider a territory guarding game involving pursuers, evaders and a target in an environment that contains obstacles. The goal of the evaders is to capture the target, while that of the pursuers is to capture the evaders before they reach the target. All the agents have limited sensing range and can only detect each other when they are in their observation space. We focus on the challenge of effective cooperation between agents of a team. Finding exact solutions for such multi-agent systems is difficult because of the inherent complexity. We present Multi-Agent Pursuer-Evader Learning (MAPEL), a class of algorithms that use spatio-temporal graph representation to learn structured cooperation. The key concept is that the learning takes place in a decentralized manner and agents use situation report updates to learn about the whole environment from each others' partial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Guidance and Control Systems · Adversarial Robustness in Machine Learning
