Multi-Aspect Mining and Anomaly Detection for Heterogeneous Tensor Streams
Soshi Kakio, Yasuko Matsubara, Ren Fujiwara, Yasushi Sakurai

TL;DR
HeteroComp is a novel method for real-time analysis and group anomaly detection in heterogeneous tensor streams, effectively handling both categorical and continuous data without discretization, and capturing temporal dynamics.
Contribution
This paper introduces HeteroComp, a new approach that models heterogeneous tensor streams with Gaussian processes to improve anomaly detection accuracy and efficiency.
Findings
Outperforms state-of-the-art algorithms in group anomaly detection.
Maintains computational efficiency regardless of data stream length.
Effectively models both categorical and continuous attributes without discretization.
Abstract
Analysis and anomaly detection in event tensor streams consisting of timestamps and multiple attributes - such as communication logs(time, IP address, packet length)- are essential tasks in data mining. While existing tensor decomposition and anomaly detection methods provide useful insights, they face the following two limitations. (i) They cannot handle heterogeneous tensor streams, which comprises both categorical attributes(e.g., IP address) and continuous attributes(e.g., packet length). They typically require either discretizing continuous attributes or treating categorical attributes as continuous, both of which distort the underlying statistical properties of the data.Furthermore, incorrect assumptions about the distribution family of continuous attributes often degrade the model's performance. (ii) They discretize timestamps, failing to track the temporal dynamics of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Time Series Analysis and Forecasting · Software System Performance and Reliability
