MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit   Assignment

Ziyan Wang; Yali Du; Yudi Zhang; Meng Fang; Biwei Huang

arXiv:2312.03644·cs.LG·January 1, 2024·1 cites

MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment

Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang

PDF

Open Access

TL;DR

This paper introduces MACCA, a causal inference framework for offline multi-agent reinforcement learning that accurately assigns credit to individual agents by modeling causal relationships, improving interpretability and performance.

Contribution

MACCA is the first framework to model causal credit assignment in offline MARL using a Dynamic Bayesian Network, ensuring accurate and interpretable credit attribution.

Findings

01

MACCA outperforms state-of-the-art methods in offline MARL tasks.

02

MACCA improves performance when integrated with various offline MARL algorithms.

03

The causal structure and reward functions are identifiable under offline data conditions.

Abstract

Offline Multi-agent Reinforcement Learning (MARL) is valuable in scenarios where online interaction is impractical or risky. While independent learning in MARL offers flexibility and scalability, accurately assigning credit to individual agents in offline settings poses challenges because interactions with an environment are prohibited. In this paper, we propose a new framework, namely Multi-Agent Causal Credit Assignment (MACCA), to address credit assignment in the offline MARL setting. Our approach, MACCA, characterizing the generative process as a Dynamic Bayesian Network, captures relationships between environmental variables, states, actions, and rewards. Estimating this model on offline data, MACCA can learn each agent's contribution by analyzing the causal relationship of their individual rewards, ensuring accurate and interpretable credit assignment. Additionally, the modularity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Explainable Artificial Intelligence (XAI)