Exploiting Latent Attack Semantics for Intelligent Malware Detection
Mkhail Kazdagli, Constantine Caramanis, Sanjay Shakkottai, Mohit, Tiwari

TL;DR
This paper introduces Shape GD, a novel malware detection system that leverages the structure and statistical shape of neighborhood-level features to improve early and robust detection of malware in noisy environments.
Contribution
Shape GD uniquely combines structural and statistical insights to aggregate weak local detectors into a robust global anomaly detector, enhancing early malware detection.
Findings
Detects malware early with high accuracy (~100 infected nodes in 100K system)
Achieves near-perfect true positive rate (~100%) and low false positive rate (~1%)
Outperforms prior methods by analyzing feature vector shapes rather than alert streams
Abstract
Behavioral malware detectors promise to expose previously unknown malware and are an important security primitive. However, even the best behavioral detectors suffer from high false positives and negatives. In this paper, we address the challenge of aggregating weak per-device behavioral detectors in noisy communities (i.e., ones that produce alerts at unpredictable rates) into an accurate and robust global anomaly detector (GD). Our system - Shape GD - combines two insights: Structural: actions such as visiting a website (waterhole attack) or membership in a shared email thread (phishing attack) by nodes correlate well with malware spread, and create dynamic neighborhoods of nodes that were exposed to the same attack vector; and Statistical: feature vectors corresponding to true and false positives of local detectors have markedly different conditional distributions. We use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Advanced Malware Detection Techniques · Spam and Phishing Detection
