Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition

Shunyu Liu; Yihe Zhou; Jie Song; Tongya Zheng; Kaixuan Chen; Tongtian; Zhu; Zunlei Feng; Mingli Song

arXiv:2211.12712·cs.LG·March 15, 2023

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition

Shunyu Liu, Yihe Zhou, Jie Song, Tongya Zheng, Kaixuan Chen, Tongtian, Zhu, Zunlei Feng, Mingli Song

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a contrastive identity-aware learning method to enhance agent diversity in multi-agent value decomposition, significantly improving cooperation and performance in cooperative MARL tasks.

Contribution

It proposes a novel contrastive learning approach to explicitly increase the distinguishability of agent credits in VD networks, addressing homogeneity issues in multi-agent systems.

Findings

01

Outperforms state-of-the-art methods on SMAC benchmarks

02

Enhances diversity and cooperation among agents

03

Compatible with various VD architectures

Abstract

Value Decomposition (VD) aims to deduce the contributions of agents for decentralized policies in the presence of only global rewards, and has recently emerged as a powerful credit assignment paradigm for tackling cooperative Multi-Agent Reinforcement Learning (MARL) problems. One of the main challenges in VD is to promote diverse behaviors among agents, while existing methods directly encourage the diversity of learned agent networks with various strategies. However, we argue that these dedicated designs for agent networks are still limited by the indistinguishable VD network, leading to homogeneous agent behaviors and thus downgrading the cooperation capability. In this paper, we propose a novel Contrastive Identity-Aware learning (CIA) method, explicitly boosting the credit-level distinguishability of the VD network to break the bottleneck of multi-agent diversity. Specifically, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liushunyu/cia
pytorchOfficial

Videos

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition· underline

Taxonomy

TopicsHealth Systems, Economic Evaluations, Quality of Life · Digital Platforms and Economics

MethodsContrastive Learning