Heterogeneous Anomaly Detection for Software Systems via Semi-supervised Cross-modal Attention
Cheryl Lee, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Yongqiang Yang,, Michael R. Lyu

TL;DR
This paper introduces Hades, a semi-supervised, cross-modal attention-based method that effectively detects system anomalies by integrating heterogeneous logs and metrics, outperforming existing single-data-type approaches.
Contribution
The paper presents the first end-to-end semi-supervised approach that fuses logs and metrics using cross-modal attention for anomaly detection in software systems.
Findings
Hades outperforms baseline methods on large-scale datasets.
Integrating logs and metrics improves anomaly detection accuracy.
The approach is effective on real-world cloud data.
Abstract
Prompt and accurate detection of system anomalies is essential to ensure the reliability of software systems. Unlike manual efforts that exploit all available run-time information, existing approaches usually leverage only a single type of monitoring data (often logs or metrics) or fail to make effective use of the joint information among different types of data. Consequently, many false predictions occur. To better understand the manifestations of system anomalies, we conduct a systematical study on a large amount of heterogeneous data, i.e., logs and metrics. Our study demonstrates that logs and metrics can manifest system anomalies collaboratively and complementarily, and neither of them only is sufficient. Thus, integrating heterogeneous data can help recover the complete picture of a system's health status. In this context, we propose Hades, the first end-to-end semi-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Anomaly Detection Techniques and Applications · Software Engineering Research
Methodsfail
