mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture
Wei Zhang, Hongcheng Guo, Jian Yang, Zhoujin Tian, Yi Zhang, Chaoran, Yan, Zhoujun Li, Tongliang Li, Xu Shi, Liangfan Zheng, Bo Zhang

TL;DR
mABC introduces a multi-agent, blockchain-inspired framework utilizing specialized large language models to improve root cause analysis accuracy and reliability in complex micro-services architectures, addressing fault propagation and circular dependencies.
Contribution
This work pioneers a decentralized, multi-agent collaboration framework with blockchain-inspired voting and standardized workflows for effective root cause analysis in micro-services environments.
Findings
Superior root cause identification on AIOps and Train-Ticket datasets
Blockchain-inspired voting enhances analysis reliability
Ablation study confirms importance of each component
Abstract
Root cause analysis (RCA) in Micro-services architecture (MSA) with escalating complexity encounters complex challenges in maintaining system stability and efficiency due to fault propagation and circular dependencies among nodes. Diverse root cause analysis faults require multi-agents with diverse expertise. To mitigate the hallucination problem of large language models (LLMs), we design blockchain-inspired voting to ensure the reliability of the analysis by using a decentralized decision-making process. To avoid non-terminating loops led by common circular dependency in MSA, we objectively limit steps and standardize task processing through Agent Workflow. We propose a pioneering framework, multi-Agent Blockchain-inspired Collaboration for root cause analysis in micro-services architecture (mABC), where multiple agents based on the powerful LLMs follow Agent Workflow and collaborate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlockchain Technology Applications and Security · Cloud Computing and Resource Management · Software System Performance and Reliability
