MAPEL: Multi-Agent Pursuer-Evader Learning using Situation Report

Sagar Verma; Richa Verma; P.B. Sujit

arXiv:1910.07780·cs.AI·October 18, 2019

MAPEL: Multi-Agent Pursuer-Evader Learning using Situation Report

Sagar Verma, Richa Verma, P.B. Sujit

PDF

Open Access 1 Repo

TL;DR

This paper introduces MAPEL, a decentralized multi-agent learning framework for pursuit-evasion games with obstacles, using situation reports and RNNs to enable effective cooperation among agents.

Contribution

The paper proposes a novel decentralized learning approach using situation reports and spatio-temporal graphs for multi-agent pursuit-evasion tasks.

Findings

01

MAPEL effectively enables agent cooperation in complex environments.

02

Peer-to-Peer and Ring Situation Report methods improve coordination.

03

Performance scales with the number of agents in the game.

Abstract

In this paper, we consider a territory guarding game involving pursuers, evaders and a target in an environment that contains obstacles. The goal of the evaders is to capture the target, while that of the pursuers is to capture the evaders before they reach the target. All the agents have limited sensing range and can only detect each other when they are in their observation space. We focus on the challenge of effective cooperation between agents of a team. Finding exact solutions for such multi-agent systems is difficult because of the inherent complexity. We present Multi-Agent Pursuer-Evader Learning (MAPEL), a class of algorithms that use spatio-temporal graph representation to learn structured cooperation. The key concept is that the learning takes place in a decentralized manner and agents use situation report updates to learn about the whole environment from each others' partial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sagarverma/MAPEL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Guidance and Control Systems · Adversarial Robustness in Machine Learning