Malicious Overtones: hunting data theft in the frequency domain with one-class learning
Brian A. Powell

TL;DR
This paper presents a frequency domain one-class learning approach for detecting long-term data exfiltration in network traffic, achieving low false positives by modeling traffic features in a fixed-size vector space.
Contribution
It introduces a modular ensemble of one-class classifiers trained on frequency domain features to identify malicious data theft over extended periods.
Findings
Achieves less than 2% false positive rate on various system types.
Effectively detects exfiltration with diverse timing and data characteristics.
Frequency domain features enable fixed-size representation of variable-length flows.
Abstract
A method for detecting electronic data theft from computer networks is described, capable of recognizing patterns of remote exfiltration occurring over days to weeks. Normal traffic flow data, in the form of a host's ingress and egress bytes over time, is used to train an ensemble of one-class learners. The detection ensemble is modular, with individual classifiers trained on different traffic features thought to characterize malicious data transfers. We select features that model the egress to ingress byte balance over time, periodicity, short time-scale irregularity, and density of the traffic. The features are most efficiently modeled in the frequency domain, which has the added benefit that variable duration flows are transformed to a fixed-size feature vector, and by sampling the frequency space appropriately, long-duration flows can be tested. When trained on days- or weeks-worth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Internet Traffic Analysis and Secure E-voting · Advanced Malware Detection Techniques
