Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making

Stelios Triantafyllou; Aleksa Sukovic; Yasaman Zolfimoselo; Goran Radanovic

arXiv:2410.12539·cs.AI·October 22, 2025

Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making

Stelios Triantafyllou, Aleksa Sukovic, Yasaman Zolfimoselo, Goran Radanovic

PDF

Open Access 1 Repo

TL;DR

This paper introduces a causal decomposition method for explaining how individual agents and state variables contribute to counterfactual outcomes in multi-agent sequential decision processes, enhancing interpretability.

Contribution

It proposes a novel causal explanation formula that decomposes counterfactual effects into agent-specific and state variable contributions using Shapley values and structure-preserving interventions.

Findings

01

Effective in a Gridworld environment with LLM agents

02

Applicable to complex scenarios like sepsis management

03

Improves interpretability of multi-agent decision effects

Abstract

We address the challenge of explaining counterfactual outcomes in multi-agent Markov decision processes. In particular, we aim to explain the total counterfactual effect of an agent's action on the outcome of a realized scenario through its influence on the environment dynamics and the agents' behavior. To achieve this, we introduce a novel causal explanation formula that decomposes the counterfactual effect by attributing to each agent and state variable a score reflecting their respective contributions to the effect. First, we show that the total counterfactual effect of an agent's action can be decomposed into two components: one measuring the effect that propagates through all subsequent agents' actions and another related to the effect that propagates through the state transitions. Building on recent advancements in causal contribution analysis, we further decompose these two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stelios30/cf-effect-decomposition
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Criteria Decision Making