Multi-Source Anomaly Detection in Distributed IT Systems
Jasmin Bogatinovski, Sasho Nedelkoski

TL;DR
This paper proposes a joint representation approach using distributed traces and logs for anomaly detection in distributed IT systems, demonstrating improved detection accuracy and a new learning task called next template prediction.
Contribution
It introduces a novel joint embedding method for traces and logs, and formalizes the next template prediction task for enhanced anomaly detection in distributed systems.
Findings
Joint trace-log embeddings outperform single modality methods.
Next template prediction effectively models system behavior.
Embeddings can be reused for other applications.
Abstract
The multi-source data generated by distributed systems, provide a holistic description of the system. Harnessing the joint distribution of the different modalities by a learning model can be beneficial for critical applications for maintenance of the distributed systems. One such important task is the task of anomaly detection where we are interested in detecting the deviation of the current behaviour of the system from the theoretically expected. In this work, we utilize the joint representation from the distributed traces and system log data for the task of anomaly detection in distributed systems. We demonstrate that the joint utilization of traces and logs produced better results compared to the single modality anomaly detection methods. Furthermore, we formalize a learning task - next template prediction NTP, that is used as a generalization for anomaly detection for both logs and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
