MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen,, Yaodong Yang, Weinan Zhang, Jun Wang

TL;DR
MALib is a scalable, efficient parallel framework designed for population-based multi-agent reinforcement learning, enabling high throughput and faster training across complex multi-agent tasks.
Contribution
The paper introduces MALib, a novel framework that addresses parallelization challenges in PB-MARL through centralized task dispatching, a new Actor-Evaluator-Learner architecture, and flexible training abstractions.
Findings
Achieves over 40K FPS on a single machine with 32 CPU cores.
Provides 5x speedup over RLlib in multi-agent training.
Offers at least 3x speedup compared to OpenSpiel.
Abstract
Population-based multi-agent reinforcement learning (PB-MARL) refers to the series of methods nested with reinforcement learning (RL) algorithms, which produces a self-generated sequence of tasks arising from the coupled population dynamics. By leveraging auto-curricula to induce a population of distinct emergent strategies, PB-MARL has achieved impressive success in tackling multi-agent tasks. Despite remarkable prior arts of distributed RL frameworks, PB-MARL poses new challenges for parallelizing the training frameworks due to the additional complexity of multiple nested workloads between sampling, training and evaluation involved with heterogeneous policy interactions. To solve these problems, we present MALib, a scalable and efficient computing framework for PB-MARL. Our framework is comprised of three key components: (1) a centralized task dispatching model, which supports the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Artificial Intelligence in Games
