An Influence-based Approach for Root Cause Alarm Discovery in Telecom Networks
Keli Zhang, Marcus Kalander, Min Zhou, Xi Zhang, Junjian Ye

TL;DR
This paper presents a novel data-driven framework combining causal inference and network embedding to improve root cause alarm localization in telecom networks, addressing challenges of network complexity and alarm volume.
Contribution
It introduces a hybrid causal graph learning method (HPCI) and a Causal Propagation-Based Embedding (CPBE) algorithm, advancing root cause analysis with less expert knowledge required.
Findings
Significant improvement over baseline methods on real telecom data
Effective real-time root cause alarm discovery
Enhanced accuracy in complex network scenarios
Abstract
Alarm root cause analysis is a significant component in the day-to-day telecommunication network maintenance, and it is critical for efficient and accurate fault localization and failure recovery. In practice, accurate and self-adjustable alarm root cause analysis is a great challenge due to network complexity and vast amounts of alarms. A popular approach for failure root cause identification is to construct a graph with approximate edges, commonly based on either event co-occurrences or conditional independence tests. However, considerable expert knowledge is typically required for edge pruning. We propose a novel data-driven framework for root cause alarm localization, combining both causal inference and network embedding techniques. In this framework, we design a hybrid causal graph learning method (HPCI), which combines Hawkes Process with Conditional Independence tests, as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Data-Driven Disease Surveillance · Software System Performance and Reliability
