Multimodal Techniques for Malware Classification
Jonathan Jiang, Mark Stamp

TL;DR
This paper explores multimodal machine learning techniques for malware classification using features from different sections of Windows PE files, demonstrating improved accuracy over baseline models.
Contribution
It introduces a multimodal approach combining features from PE headers and sections, showing that separate models on different parts enhance malware classification performance.
Findings
Multimodal models outperform baseline models.
Training on separate PE file parts improves accuracy.
Combining features from multiple sections yields better results.
Abstract
The threat of malware is a serious concern for computer networks and systems, highlighting the need for accurate classification techniques. In this research, we experiment with multimodal machine learning approaches for malware classification, based on the structured nature of the Windows Portable Executable (PE) file format. Specifically, we train Support Vector Machine (SVM), Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) models on features extracted from PE headers, we train these same models on features extracted from the other sections of PE files, and train each model on features extracted from the entire PE file. We then train SVM models on each of the nine header-sections combinations of these baseline models, using the output layer probabilities of the component models as feature vectors. We compare the baseline cases to these multimodal combinations. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications
MethodsSupport Vector Machine
