MatrixWorld: A pursuit-evasion platform for safe multi-agent coordination and autocurricula
Lijun Sun, Yu-Cheng Chang, Chao Lyu, Chin-Teng Lin, and Yuhui Shi

TL;DR
MatrixWorld is a new multi-agent environment designed to enhance safety in reinforcement learning tasks, supporting pursuit-evasion scenarios with safety constraints, conflict resolution, and serving as a platform for autocurricula research.
Contribution
This work introduces MatrixWorld, a safety-constrained pursuit-evasion platform with novel conflict resolution methods and a co-evolution framework for safe multi-agent learning and autocurricula studies.
Findings
Extended conflict resolution to heterogeneous/adversarial agents
Proposed three conflict resolution strategies
MatrixWorld supports diverse pursuit-evasion variants
Abstract
Multi-agent reinforcement learning (MARL) achieves encouraging performance in solving complex tasks. However, the safety of MARL policies is one critical concern that impedes their real-world applications. Popular multi-agent benchmarks focus on diverse tasks yet provide limited safety support. Therefore, this work proposes a safety-constrained multi-agent environment: MatrixWorld, based on the general pursuit-evasion game. Particularly, a safety-constrained multi-agent action execution model is proposed for the software implementation of safe multi-agent environments based on diverse safety definitions. It (1) extends the vertex conflict among homogeneous / cooperative agents to heterogeneous / adversarial settings, and (2) proposes three types of resolutions for each type of conflict, aiming at providing rational and unbiased feedback for safe MARL. Besides, MatrixWorld is also a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
