Multimodal Techniques for Malware Classification

Jonathan Jiang; Mark Stamp

arXiv:2501.10956·cs.CR·January 22, 2025

Multimodal Techniques for Malware Classification

Jonathan Jiang, Mark Stamp

PDF

Open Access

TL;DR

This paper explores multimodal machine learning techniques for malware classification using features from different sections of Windows PE files, demonstrating improved accuracy over baseline models.

Contribution

It introduces a multimodal approach combining features from PE headers and sections, showing that separate models on different parts enhance malware classification performance.

Findings

01

Multimodal models outperform baseline models.

02

Training on separate PE file parts improves accuracy.

03

Combining features from multiple sections yields better results.

Abstract

The threat of malware is a serious concern for computer networks and systems, highlighting the need for accurate classification techniques. In this research, we experiment with multimodal machine learning approaches for malware classification, based on the structured nature of the Windows Portable Executable (PE) file format. Specifically, we train Support Vector Machine (SVM), Long Short-Term Memory (LSTM), and Convolutional Neural Network (CNN) models on features extracted from PE headers, we train these same models on features extracted from the other sections of PE files, and train each model on features extracted from the entire PE file. We then train SVM models on each of the nine header-sections combinations of these baseline models, using the output layer probabilities of the component models as feature vectors. We compare the baseline cases to these multimodal combinations. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications

MethodsSupport Vector Machine