From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration
Yizhe Xie, Congcong Zhu, Xinyue Zhang, Tianqing Zhu, Dayong Ye, Minfeng Qi, Huajie Chen, Wanlei Zhou

TL;DR
This paper models error propagation in LLM-based multi-agent systems, identifying vulnerabilities and proposing a genealogy-graph-based governance layer to effectively mitigate cascading errors without altering the core architecture.
Contribution
It introduces a propagation dynamics model and a novel governance mechanism that suppresses error cascades in multi-agent systems, enhancing robustness.
Findings
Error propagation can cause widespread failure from a single error seed.
The proposed governance layer prevents final infection in at least 89% of runs.
Identified three vulnerability classes: cascade amplification, topological sensitivity, and consensus inertia.
Abstract
Large Language Model-based Multi-Agent Systems (LLM-MAS) are increasingly applied to complex collaborative scenarios. However, their collaborative mechanisms may cause minor inaccuracies to gradually solidify into system-level false consensus through iteration. Such risks are difficult to trace since errors can propagate and amplify through message dependencies. Existing protections often rely on single-agent validation or require modifications to the collaboration architecture, which can weaken effective information flow and may not align with natural collaboration processes in real tasks. To address this, we propose a propagation dynamics model tailored for LLM-MAS that abstracts collaboration as a directed dependency graph and provides an early-stage risk criterion to characterize amplification risk. Through experiments on six mainstream frameworks, we identify three vulnerability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
