Malware Detection Through Memory Analysis
Sarah Nassar

TL;DR
This study demonstrates that machine learning, specifically XGBoost, can effectively and efficiently detect malware using memory analysis data, achieving near-perfect accuracy in binary classification and high speed for real-time applications.
Contribution
The paper evaluates XGBoost for malware detection using memory analysis, showing its high accuracy and speed, advancing real-time cybersecurity defenses.
Findings
Binary classifier accuracy: 99.98%
Multi-class accuracy: 87.54%
Classification speed: ~37-43 milliseconds for 50 samples
Abstract
This paper summarizes the research conducted for a malware detection project using the Canadian Institute for Cybersecurity's MalMemAnalysis-2022 dataset. The purpose of the project was to explore the effectiveness and efficiency of machine learning techniques for the task of binary classification (i.e., benign or malicious) as well as multi-class classification to further include three malware sub-types (i.e., benign, ransomware, spyware, or Trojan horse). The XGBoost model type was the final model selected for both tasks due to the trade-off between strong detection capability and fast inference speed. The binary classifier achieved a testing subset accuracy and F1 score of 99.98\%, while the multi-class version reached an accuracy of 87.54\% and an F1 score of 81.26\%, with an average F1 score over the malware sub-types of 75.03\%. In addition to the high modelling performance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Adversarial Robustness in Machine Learning
