MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems
Jin Jia, Zhiling Deng, Zhuangbin Chen, Yingqi Wang, and Zibin Zheng

TL;DR
MAS-FIRE introduces a comprehensive fault injection framework to evaluate and diagnose the reliability of LLM-based multi-agent systems, revealing insights into fault tolerance and the impact of architecture and model strength.
Contribution
This paper presents MAS-FIRE, a novel systematic framework for fault injection and detailed reliability analysis of multi-agent systems using large language models.
Findings
Stronger models do not always enhance robustness.
Architectural topology significantly influences fault tolerance.
Iterative, closed-loop designs mitigate over 40% of catastrophic faults.
Abstract
As LLM-based Multi-Agent Systems (MAS) are increasingly deployed for complex tasks, ensuring their reliability has become a pressing challenge. Since MAS coordinate through unstructured natural language rather than rigid protocols, they are prone to semantic failures (e.g., hallucinations, misinterpreted instructions, and reasoning drift) that propagate silently without raising runtime exceptions. Prevailing evaluation approaches, which measure only end-to-end task success, offer limited insight into how these failures arise or how effectively agents recover from them. To bridge this gap, we propose MAS-FIRE, a systematic framework for fault injection and reliability evaluation of MAS. We define a taxonomy of 15 fault types covering intra-agent cognitive errors and inter-agent coordination failures, and inject them via three non-invasive mechanisms: prompt modification, response…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · AI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge
