Causal Mean Field Multi-Agent Reinforcement Learning

Hao Ma; Zhiqiang Pu; Yi Pan; Boyin Liu; Junlong Gao; Zhenyu Guo

arXiv:2502.14200·cs.AI·February 21, 2025

Causal Mean Field Multi-Agent Reinforcement Learning

Hao Ma, Zhiqiang Pu, Yi Pan, Boyin Liu, Junlong Gao, Zhenyu Guo

PDF

TL;DR

This paper introduces CMFQ, a causality-aware algorithm that enhances the scalability of multi-agent reinforcement learning by modeling and quantifying essential interactions, demonstrating robustness in environments with many agents.

Contribution

The paper proposes a novel causal mean-field Q-learning framework that incorporates structural causal models to identify and leverage key interactions, improving scalability in multi-agent settings.

Findings

01

CMFQ outperforms existing methods in large-agent environments.

02

The approach maintains robustness despite nonstationary environments.

03

Effective in both cooperative and mixed game scenarios.

Abstract

Scalability remains a challenge in multi-agent reinforcement learning and is currently under active research. A framework named mean-field reinforcement learning (MFRL) could alleviate the scalability problem by employing the Mean Field Theory to turn a many-agent problem into a two-agent problem. However, this framework lacks the ability to identify essential interactions under nonstationary environments. Causality contains relatively invariant mechanisms behind interactions, though environments are nonstationary. Therefore, we propose an algorithm called causal mean-field Q-learning (CMFQ) to address the scalability problem. CMFQ is ever more robust toward the change of the number of agents though inheriting the compressed representation of MFRL's action-state space. Firstly, we model the causality behind the decision-making process of MFRL into a structural causal model (SCM). Then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning