Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network
Ritik Mehta, Olha Jureckova, Mark Stamp

TL;DR
This paper introduces a hybrid HMM-CNN model for malware classification that captures sequential opcode patterns and hierarchical features, achieving superior accuracy on the Malicia dataset compared to existing methods.
Contribution
The paper presents a novel hybrid architecture combining HMM and CNN for malware classification, improving detection performance over previous models like HMM-Random Forest.
Findings
Outperforms HMM-Random Forest on Malicia dataset
Effectively captures sequential opcode patterns and hierarchical features
Demonstrates potential for enhanced cybersecurity malware detection
Abstract
The proliferation of malware variants poses a significant challenges to traditional malware detection approaches, such as signature-based methods, necessitating the development of advanced machine learning techniques. In this research, we present a novel approach based on a hybrid architecture combining features extracted using a Hidden Markov Model (HMM), with a Convolutional Neural Network (CNN) then used for malware classification. Inspired by the strong results in previous work using an HMM-Random Forest model, we propose integrating HMMs, which serve to capture sequential patterns in opcode sequences, with CNNs, which are adept at extracting hierarchical features. We demonstrate the effectiveness of our approach on the popular Malicia dataset, and we obtain superior performance, as compared to other machine learning methods -- our results surpass the aforementioned HMM-Random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications
