Detecting Clusters of Anomalies on Low-Dimensional Feature Subsets with Application to Network Traffic Flow Data
Zhicong Qiu, David J. Miller, George Kesidis

TL;DR
This paper introduces a novel group anomaly detection method that identifies clusters of anomalous data in low-dimensional feature subsets, specifically applied to network traffic data for detecting malicious activity.
Contribution
The work develops a new GAD scheme that captures feature dependencies and jointly identifies anomalous groups and feature subsets, advancing beyond previous independence-based methods.
Findings
Effective detection of BotNet and P2P flow clusters in real network data.
Exploiting feature dependencies improves anomaly detection accuracy.
Outperforms methods assuming feature independence.
Abstract
In a variety of applications, one desires to detect groups of anomalous data samples, with a group potentially manifesting its atypicality (relative to a reference model) on a low-dimensional subset of the full measured set of features. Samples may only be weakly atypical individually, whereas they may be strongly atypical when considered jointly. What makes this group anomaly detection problem quite challenging is that it is a priori unknown which subset of features jointly manifests a particular group of anomalies. Moreover, it is unknown how many anomalous groups are present in a given data batch. In this work, we develop a group anomaly detection (GAD) scheme to identify the subset of samples and subset of features that jointly specify an anomalous cluster. We apply our approach to network intrusion detection to detect BotNet and peer-to-peer flow clusters. Unlike previous studies,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Internet Traffic Analysis and Secure E-voting
