MACIE: Multi-Agent Causal Intelligence Explainer for Collective Behavior Understanding
Abraham Itzhak Weinberg

TL;DR
MACIE is a comprehensive framework that explains multi-agent reinforcement learning behaviors by combining causal models, counterfactuals, and Shapley values, enabling detailed attribution and emergent behavior analysis.
Contribution
It introduces MACIE, a novel explainability framework that uniquely integrates causal inference, emergence metrics, and natural language explanations for multi-agent systems.
Findings
Accurate attribution of agent contributions with mean phi_i of 5.07
Detection of positive emergence in cooperative tasks
Efficient computation at 0.79 seconds per dataset
Abstract
As Multi Agent Reinforcement Learning systems are used in safety critical applications. Understanding why agents make decisions and how they achieve collective behavior is crucial. Existing explainable AI methods struggle in multi agent settings. They fail to attribute collective outcomes to individuals, quantify emergent behaviors, or capture complex interactions. We present MACIE Multi Agent Causal Intelligence Explainer, a framework combining structural causal models, interventional counterfactuals, and Shapley values to provide comprehensive explanations. MACIE addresses three questions. First, each agent's causal contribution using interventional attribution scores. Second, system level emergent intelligence through synergy metrics separating collective effects from individual contributions. Third, actionable explanations using natural language narratives synthesizing causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
