LLMalMorph: On The Feasibility of Generating Variant Malware using Large-Language-Models
Md Ajwad Akil, Adrian Shuai Li, Imtiaz Karim, Arun Iyengar, Ashish Kundu, Vinny Parla, Elisa Bertino

TL;DR
This paper investigates how large language models can be used to generate malware variants by modifying source code, demonstrating effective evasion of antivirus and ML-based detectors with a semi-automated framework.
Contribution
Introduces LLMalMorph, a novel semi-automated framework leveraging LLMs for generating malware variants without fine-tuning, highlighting its effectiveness and limitations.
Findings
Achieved 10-15% reduction in antivirus detection rates.
Attained up to 91% success rate against ML-based malware detectors.
Generated 618 malware variants from 10 diverse samples.
Abstract
Large Language Models (LLMs) have transformed software development and automated code generation. Motivated by these advancements, this paper explores the feasibility of LLMs in modifying malware source code to generate variants. We introduce LLMalMorph, a semi-automated framework that leverages semantical and syntactical code comprehension by LLMs to generate new malware variants. LLMalMorph extracts function-level information from the malware source code and employs custom-engineered prompts coupled with strategically defined code transformations to guide the LLM in generating variants without resource-intensive fine-tuning. To evaluate LLMalMorph, we collected 10 diverse Windows malware samples of varying types, complexity and functionality and generated 618 variants. Our experiments demonstrate that LLMalMorph variants can effectively evade antivirus engines, achieving typical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Information and Cyber Security · Digital and Cyber Forensics
