MatrixWorld: A pursuit-evasion platform for safe multi-agent   coordination and autocurricula

Lijun Sun; Yu-Cheng Chang; Chao Lyu; Chin-Teng Lin; and Yuhui Shi

arXiv:2307.14854·cs.MA·June 6, 2024

MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula

Lijun Sun, Yu-Cheng Chang, Chao Lyu, Chin-Teng Lin, and Yuhui Shi

PDF

Open Access 1 Repo

TL;DR

MatrixWorld is a new multi-agent environment designed to enhance safety in reinforcement learning tasks, supporting pursuit-evasion scenarios with safety constraints, conflict resolution, and serving as a platform for autocurricula research.

Contribution

This work introduces MatrixWorld, a safety-constrained pursuit-evasion platform with novel conflict resolution methods and a co-evolution framework for safe multi-agent learning and autocurricula studies.

Findings

01

Extended conflict resolution to heterogeneous/adversarial agents

02

Proposed three conflict resolution strategies

03

MatrixWorld supports diverse pursuit-evasion variants

Abstract

Multi-agent reinforcement learning (MARL) achieves encouraging performance in solving complex tasks. However, the safety of MARL policies is one critical concern that impedes their real-world applications. Popular multi-agent benchmarks focus on diverse tasks yet provide limited safety support. Therefore, this work proposes a safety-constrained multi-agent environment: MatrixWorld, based on the general pursuit-evasion game. Particularly, a safety-constrained multi-agent action execution model is proposed for the software implementation of safe multi-agent environments based on diverse safety definitions. It (1) extends the vertex conflict among homogeneous / cooperative agents to heterogeneous / adversarial settings, and (2) proposes three types of resolutions for each type of conflict, aiming at providing rational and unbiased feedback for safe MARL. Besides, MatrixWorld is also a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lijunsun90/matrixworld
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning