Am I Rare? An Intelligent Summarization Approach for Identifying Hidden Anomalies
Samira Ghodratnama, Mehrdad Zakershahrak, Fariborz Sobhanmanesh

TL;DR
This paper introduces INSIDENT, an intelligent clustering-based summarization method that preserves data distribution to effectively identify hidden anomalies in network traffic, reducing computational costs while maintaining detection accuracy.
Contribution
The paper presents a novel clustering-based summarization approach that maintains original data distribution, enhancing anomaly detection effectiveness in summarized network traffic data.
Findings
Summarized data can substitute original data in anomaly detection.
INSIDENT preserves data distribution and improves anomaly detection accuracy.
Experimental results on benchmark datasets validate the approach's effectiveness.
Abstract
Monitoring network traffic data to detect any hidden patterns of anomalies is a challenging and time-consuming task that requires high computing resources. To this end, an appropriate summarization technique is of great importance, where it can be a substitute for the original data. However, the summarized data is under the threat of removing anomalies. Therefore, it is vital to create a summary that can reflect the same pattern as the original data. Therefore, in this paper, we propose an INtelligent Summarization approach for IDENTifying hidden anomalies, called INSIDENT. The proposed approach guarantees to keep the original data distribution in summarized data. Our approach is a clustering-based algorithm that dynamically maps original feature space to a new feature space by locally weighting features in each cluster. Therefore, in new feature space, similar samples are closer, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
