ML-Based Behavioral Malware Detection Is Far From a Solved Problem
Yigitcan Kaya, Yizheng Chen, Marcus Botacin, Shoumik Saha, Fabio, Pierazzi, Lorenzo Cavallaro, David Wagner, Tudor Dumitras

TL;DR
This study reveals a significant performance gap between laboratory ML malware detectors trained on sandbox data and real-world endpoint detection, emphasizing the need for training on endpoint data for practical effectiveness.
Contribution
First measurement study evaluating ML malware detectors on real endpoint data, highlighting challenges and proposing techniques to improve detection performance.
Findings
Performance drops from over 90% in sandbox evaluations to 20-50% in real endpoint scenarios.
Challenges include label noise, distribution shift, and spurious features.
Training on endpoint data yields better real-world detection performance.
Abstract
Malware detection is a ubiquitous application of Machine Learning (ML) in security. In behavioral malware analysis, the detector relies on features extracted from program execution traces. The research literature has focused on detectors trained with features collected from sandbox environments and evaluated on samples also analyzed in a sandbox. However, in deployment, a malware detector at endpoint hosts often must rely on traces captured from endpoint hosts, not from a sandbox. Thus, there is a gap between the literature and real-world needs. We present the first measurement study of the performance of ML-based malware detectors at real-world endpoints. Leveraging a dataset of sandbox traces and a dataset of in-the-wild program traces, we evaluate two scenarios: (i) an endpoint detector trained on sandbox traces (convenient and easy to train), and (ii) an endpoint detector trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Digital and Cyber Forensics
