Malware Detection based on API calls
Christofer Fellicious, Manuel Bischof, Kevin Mayer, Dorian Eikenberg,, Stefan Hausotte, Hans P. Reiser, Michael Granitzer

TL;DR
This paper presents a lightweight, order-invariant machine learning approach for malware detection using API call analysis, demonstrating high accuracy with minimal computational overhead and providing a large public dataset.
Contribution
It introduces a novel order-invariant API call analysis method for malware detection, along with a comprehensive, publicly available dataset and open-source code.
Findings
Achieved over 85% F1-Score in malware detection
Effective detection using only calls to ntdll.dll
Developed a scalable, lightweight detection model
Abstract
Malware attacks pose a significant threat in today's interconnected digital landscape, causing billions of dollars in damages. Detecting and identifying families as early as possible provides an edge in protecting against such malware. We explore a lightweight, order-invariant approach to detecting and mitigating malware threats: analyzing API calls without regard to their sequence. We publish a public dataset of over three hundred thousand samples and their function call parameters for this task, annotated with labels indicating benign or malicious activity. The complete dataset is above 550GB uncompressed in size. We leverage machine learning algorithms, such as random forests, and conduct behavioral analysis by examining patterns and anomalies in API call sequences. By investigating how the function calls occur regardless of their order, we can identify discriminating features that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Information and Cyber Security
