Am I Rare? An Intelligent Summarization Approach for Identifying Hidden   Anomalies

Samira Ghodratnama; Mehrdad Zakershahrak; Fariborz Sobhanmanesh

arXiv:2012.15755·cs.LG·December 21, 2021

Am I Rare? An Intelligent Summarization Approach for Identifying Hidden Anomalies

Samira Ghodratnama, Mehrdad Zakershahrak, Fariborz Sobhanmanesh

PDF

TL;DR

This paper introduces INSIDENT, an intelligent clustering-based summarization method that preserves data distribution to effectively identify hidden anomalies in network traffic, reducing computational costs while maintaining detection accuracy.

Contribution

The paper presents a novel clustering-based summarization approach that maintains original data distribution, enhancing anomaly detection effectiveness in summarized network traffic data.

Findings

01

Summarized data can substitute original data in anomaly detection.

02

INSIDENT preserves data distribution and improves anomaly detection accuracy.

03

Experimental results on benchmark datasets validate the approach's effectiveness.

Abstract

Monitoring network traffic data to detect any hidden patterns of anomalies is a challenging and time-consuming task that requires high computing resources. To this end, an appropriate summarization technique is of great importance, where it can be a substitute for the original data. However, the summarized data is under the threat of removing anomalies. Therefore, it is vital to create a summary that can reflect the same pattern as the original data. Therefore, in this paper, we propose an INtelligent Summarization approach for IDENTifying hidden anomalies, called INSIDENT. The proposed approach guarantees to keep the original data distribution in summarized data. Our approach is a clustering-based algorithm that dynamically maps original feature space to a new feature space by locally weighting features in each cluster. Therefore, in new feature space, similar samples are closer, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.