Containerized Distributed Value-Based Multi-Agent Reinforcement Learning

Siyang Wu; Tonghan Wang; Chenghao Li; Yang Hu; Chongjie Zhang

arXiv:2110.08169·cs.LG·December 6, 2021

Containerized Distributed Value-Based Multi-Agent Reinforcement Learning

Siyang Wu, Tonghan Wang, Chenghao Li, Yang Hu, Chongjie Zhang

PDF

Open Access

TL;DR

This paper introduces a containerized distributed multi-agent reinforcement learning framework that enhances scalability, diversity, and efficiency, successfully applying it to complex environments like Google Research Football and StarCraft II.

Contribution

It presents a novel containerized architecture with multi-queue management for scalable, diverse, and efficient distributed MARL, achieving state-of-the-art results in complex benchmarks.

Findings

01

Achieved 4-18x better results than non-distributed MARL algorithms on StarCraft II.

02

First to successfully solve Google Research Football full game 5v5.

03

Developed a scalable, time-efficient distributed MARL framework.

Abstract

Multi-agent reinforcement learning tasks put a high demand on the volume of training samples. Different from its single-agent counterpart, distributed value-based multi-agent reinforcement learning faces the unique challenges of demanding data transfer, inter-process communication management, and high requirement of exploration. We propose a containerized learning framework to solve these problems. We pack several environment instances, a local learner and buffer, and a carefully designed multi-queue manager which avoids blocking into a container. Local policies of each container are encouraged to be as diverse as possible, and only trajectories with highest priority are sent to a global learner. In this way, we achieve a scalable, time-efficient, and diverse distributed MARL learning framework with high system throughput. To own knowledge, our method is the first to solve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing