A Decompilation-Driven Framework for Malware Detection with Large Language Models
Aniesh Chawla, Udbhav Prasad

TL;DR
This paper explores using large language models, combined with decompilation, for malware detection, highlighting the potential and current limitations of LLMs in cybersecurity and emphasizing the need for ongoing fine-tuning.
Contribution
It introduces an automated decompilation-based pipeline for malware classification using LLMs and evaluates the impact of fine-tuning on detection performance.
Findings
Fine-tuned LLMs outperform vanilla models in malware detection.
Model performance degrades with newer malware variants.
Continuous fine-tuning is essential for maintaining effectiveness.
Abstract
The parallel evolution of Large Language Models (LLMs) with advanced code-understanding capabilities and the increasing sophistication of malware presents a new frontier for cybersecurity research. This paper evaluates the efficacy of state-of-the-art LLMs in classifying executable code as either benign or malicious. We introduce an automated pipeline that first decompiles Windows executable into a C code using Ghidra disassembler and then leverages LLMs to perform the classification. Our evaluation reveals that while standard LLMs show promise, they are not yet robust enough to replace traditional anti-virus software. We demonstrate that a fine-tuned model, trained on curated malware and benign datasets, significantly outperforms its vanilla counterpart. However, the performance of even this specialized model degrades notably when encountering newer malware. This finding demonstrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Digital and Cyber Forensics
