CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama, Kefan Su, Jiangxing Wang, Deheng Ye, Zongqing Lu

TL;DR
CORD introduces a hierarchical multi-agent reinforcement learning framework that enhances generalization by promoting role diversity, enabling agents to adapt to unseen collaborators without prior knowledge.
Contribution
The paper proposes a novel hierarchical MARL method, CORD, that maximizes role entropy with constraints to improve generalization in cooperative tasks.
Findings
CORD outperforms baselines in generalization tests.
Role diversity improves cooperation among agents.
Ablation studies confirm the effectiveness of the constrained objective.
Abstract
Cooperative multi-agent reinforcement learning (MARL) aims to develop agents that can collaborate effectively. However, most cooperative MARL methods overfit training agents, making learned policies not generalize well to unseen collaborators, which is a critical issue for real-world deployment. Some methods attempt to address the generalization problem but require prior knowledge or predefined policies of new teammates, limiting real-world applications. To this end, we propose a hierarchical MARL approach to enable generalizable cooperation via role diversity, namely CORD. CORD's high-level controller assigns roles to low-level agents by maximizing the role entropy with constraints. We show this constrained objective can be decomposed into causal influence in role that enables reasonable role assignment, and role heterogeneity that yields coherent, non-redundant role clusters.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications
