Computer activity learning from system call time series

Curt Hastings; Ronnie Mainieri

arXiv:1711.02088·cs.CR·November 8, 2017

Computer activity learning from system call time series

Curt Hastings, Ronnie Mainieri

PDF

Open Access

TL;DR

This paper presents a deep learning-based classifier for computer activity recognition from system call time series, achieving high accuracy in malware classification and demonstrating scalability and practical utility.

Contribution

Introduces a novel deep learning approach using a similarity function for system call streams, outperforming existing malware classifiers and providing scalable, real-world performance metrics.

Findings

01

Achieves F1 score of 0.995 on malware classification

02

System scales linearly with number of endpoints

03

Estimates 3450 malware families over 10 years

Abstract

Using a previously introduced similarity function for the stream of system calls generated by a computer, we engineer a program-in-execution classifier using deep learning methods. Tested on malware classification, it significantly outperforms current state of the art. We provide a series of performance measures and tests to demonstrate the capabilities, including measurements from production use. We show how the system scales linearly with the number of endpoints. With the system we estimate the total number of malware families created over the last 10 years as 3450, in line with reasonable economic constraints. The more limited rate for new malware families than previously acknowledged implies that machine learning malware classifiers risk being tested on their training set; we achieve F1 = 0.995 in a test carefully designed to mitigate this risk.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection