Feature Extraction for Novelty Detection in Network Traffic
Kun Yang, Samory Kpotufe, Nick Feamster

TL;DR
This paper introduces an open-source framework and Python library for extracting features from network traffic to improve novelty detection, enabling easier exploration of different data representations and evaluation across various models.
Contribution
The authors develop and release a systematic toolkit and library that facilitate feature extraction and evaluation for novelty detection in network traffic, broadening the scope beyond traditional IPFIX/NetFlow representations.
Findings
Certain features are more effective for specific detection scenarios.
The framework enables comparison of different data representations and models.
Guidelines are provided for selecting features based on the detection context.
Abstract
Data representation plays a critical role in the performance of novelty detection (or ``anomaly detection'') methods in machine learning. The data representation of network traffic often determines the effectiveness of these models as much as the model itself. The wide range of novel events that network operators need to detect (e.g., attacks, malware, new applications, changes in traffic demands) introduces the possibility for a broad range of possible models and data representations. In each scenario, practitioners must spend significant effort extracting and engineering features that are most predictive for that situation or application. While anomaly detection is well-studied in computer networking, much existing work develops specific models that presume a particular representation -- often IPFIX/NetFlow. Yet, other representations may result in higher model accuracy, and the rise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Anomaly Detection Techniques and Applications · Software System Performance and Reliability
