KGroot: Enhancing Root Cause Analysis through Knowledge Graphs and Graph Convolutional Neural Networks
Tingting Wang, Guilin Qi, Tianxing Wu

TL;DR
KGroot leverages knowledge graphs and graph convolutional neural networks to improve fault localization in micro-services, achieving high accuracy and real-time performance in industrial environments.
Contribution
This paper introduces KGroot, a novel method combining knowledge graphs and GCNs for automatic root cause analysis in micro-services, surpassing existing approaches in accuracy and efficiency.
Findings
Achieves 93.5% top 3 accuracy in fault localization
Operates in second-level for real-time diagnosis
Outperforms state-of-the-art baselines in effectiveness and efficiency
Abstract
Fault localization is challenging in online micro-service due to the wide variety of monitoring data volume, types, events and complex interdependencies in service and components. Faults events in services are propagative and can trigger a cascade of alerts in a short period of time. In the industry, fault localization is typically conducted manually by experienced personnel. This reliance on experience is unreliable and lacks automation. Different modules present information barriers during manual localization, making it difficult to quickly align during urgent faults. This inefficiency lags stability assurance to minimize fault detection and repair time. Though actionable methods aimed to automatic the process, the accuracy and efficiency are less than satisfactory. The precision of fault localization results is of paramount importance as it underpins engineers trust in the diagnostic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Brain Tumor Detection and Classification · Occupational Health and Safety Research
Methodstravel james · ALIGN
