CTDS: Centralized Teacher with Decentralized Student for Multi-Agent   Reinforcement Learning

Jian Zhao; Xunhan Hu; Mingyu Yang; Wengang Zhou; Jiangcheng Zhu and; Houqiang Li

arXiv:2203.08412·cs.MA·March 17, 2022·1 cites

CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning

Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu and, Houqiang Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces CTDS, a novel framework for multi-agent reinforcement learning that uses a centralized teacher to improve decentralized student performance, enhancing global information utilization during training.

Contribution

The paper proposes a new CTDS framework that combines centralized teaching with decentralized execution, improving upon existing CTDE methods in MARL.

Findings

01

CTDS outperforms existing value-based MARL methods in StarCraft II tasks.

02

The framework effectively balances global observation use during training and decentralized inference.

03

Experimental results demonstrate improved learning efficiency and performance.

Abstract

Due to the partial observability and communication constraints in many multi-agent reinforcement learning (MARL) tasks, centralized training with decentralized execution (CTDE) has become one of the most widely used MARL paradigms. In CTDE, centralized information is dedicated to learning the allocation of the team reward with a mixing network, while the learning of individual Q-values is usually based on local observations. The insufficient utility of global observation will degrade performance in challenging environments. To this end, this work proposes a novel Centralized Teacher with Decentralized Student (CTDS) framework, which consists of a teacher model and a student model. Specifically, the teacher model allocates the team reward by learning individual Q-values conditioned on global observation, while the student model utilizes the partial observations to approximate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cathyhxh/ctds
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Open Source Software Innovations · Mobile Crowdsensing and Crowdsourcing