Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour
B\'alint Gyevn\'ar, Christopher G. Lucas, Stefano V. Albrecht, Shay B. Cohen

TL;DR
This paper introduces AXIS, a novel method that uses counterfactual simulations and large language models to generate human-centered explanations for multi-agent systems, improving trust and understanding.
Contribution
AXIS is the first approach to leverage interrogative simulation with LLMs for explaining multi-agent policies in complex environments.
Findings
AXIS improves explanation correctness by at least 7.7%.
AXIS increases goal prediction accuracy by 23%.
AXIS achieves the highest scores in explanation quality.
Abstract
Autonomous multi-agent systems (MAS) are useful for automating complex tasks but raise trust concerns due to risks such as miscoordination or goal misalignment. Explainability is vital for users' trust calibration, but explainable MAS face challenges due to complex environments, the human factor, and non-standardised evaluation. Leveraging the counterfactual effect size model and LLMs, we propose Agentic eXplanations via Interrogative Simulation (AXIS). AXIS generates human-centred action explanations for multi-agent policies by having an LLM interrogate an environment simulator using prompts like 'whatif' and 'remove' to observe and synthesise counterfactual information over multiple rounds. We evaluate AXIS on autonomous driving across ten scenarios for five LLMs with a comprehensive methodology combining robustness, subjective preference, correctness, and goal/action prediction with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMixing Adam and SGD
