MALib: A Parallel Framework for Population-based Multi-agent   Reinforcement Learning

Ming Zhou; Ziyu Wan; Hanjing Wang; Muning Wen; Runzhe Wu; Ying Wen,; Yaodong Yang; Weinan Zhang; Jun Wang

arXiv:2106.07551·cs.MA·June 15, 2021·25 cites

MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning

Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen,, Yaodong Yang, Weinan Zhang, Jun Wang

PDF

Open Access 1 Repo

TL;DR

MALib is a scalable, efficient parallel framework designed for population-based multi-agent reinforcement learning, enabling high throughput and faster training across complex multi-agent tasks.

Contribution

The paper introduces MALib, a novel framework that addresses parallelization challenges in PB-MARL through centralized task dispatching, a new Actor-Evaluator-Learner architecture, and flexible training abstractions.

Findings

01

Achieves over 40K FPS on a single machine with 32 CPU cores.

02

Provides 5x speedup over RLlib in multi-agent training.

03

Offers at least 3x speedup compared to OpenSpiel.

Abstract

Population-based multi-agent reinforcement learning (PB-MARL) refers to the series of methods nested with reinforcement learning (RL) algorithms, which produces a self-generated sequence of tasks arising from the coupled population dynamics. By leveraging auto-curricula to induce a population of distinct emergent strategies, PB-MARL has achieved impressive success in tackling multi-agent tasks. Despite remarkable prior arts of distributed RL frameworks, PB-MARL poses new challenges for parallelizing the training frameworks due to the additional complexity of multiple nested workloads between sampling, training and evaluation involved with heterogeneous policy interactions. To solve these problems, we present MALib, a scalable and efficient computing framework for PB-MARL. Our framework is comprised of three key components: (1) a centralized task dispatching model, which supports the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sjtu-marl/malib
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Artificial Intelligence in Games