Malfustection: Obfuscated Malware Detection and Malware Classification with Data Shortage by Combining Semi-Supervised and Contrastive Learning
Mohammad Mahdi Maghouli, Mohamadreza Fereydooni, Monireh Abdoos, and, Mojtaba Vahidi-Asl

TL;DR
This paper introduces a novel malware detection and classification method that converts malware code into images and employs semi-supervised and contrastive learning to effectively identify obfuscated malware despite limited data.
Contribution
It proposes a new approach combining image conversion, data augmentation, and contrastive learning to detect obfuscated malware with minimal labeled data.
Findings
Achieves 90.1% accuracy with only 10% training data.
Detects obfuscated malware with 96.21% accuracy when trained on non-obfuscated malware.
Addresses data shortage and obfuscation challenges in malware detection.
Abstract
With the advent of new technologies, using various formats of digital gadgets is becoming widespread. In today's world, where everyday tasks are inevitable without technology, this extensive use of computers paves the way for malicious activity. As a result, it is important to provide solutions to defend against these threats. Malware is one of the well-known and widely used means utilized for doing destructive activities by malicious attackers. Producing malware from scratch is somewhat difficult, so attackers tend to obfuscate existing malware and prepare it to become an unrecognizable program. Since creating new malware from an old one using obfuscation is a creative task, there are some drawbacks to identifying obfuscated malwares. In this research, we propose a solution to overcome this problem by converting the code to an image in the first step and then using a semi-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection
