MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation

Lu Yan; Zhuo Zhang; Xiangzhe Xu; Shengwei An; Guangyu Shen; Zhou Xuan; Xuan Chen; Xiangyu Zhang

arXiv:2507.02057·cs.CR·July 4, 2025

MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation

Lu Yan, Zhuo Zhang, Xiangzhe Xu, Shengwei An, Guangyu Shen, Zhou Xuan, Xuan Chen, Xiangyu Zhang

PDF

TL;DR

This paper presents MGC, a framework that exploits vulnerabilities in aligned LLMs to generate malware by decomposing malicious tasks into benign sub-tasks, revealing security risks in current alignment safeguards.

Contribution

MGC introduces a novel modular decomposition approach and a malware-specific intermediate representation to systematically generate malware from aligned LLMs, exposing security vulnerabilities.

Findings

01

Outperforms existing jailbreak methods by +365.79% in correctness

02

Successfully reproduces and enhances 16 real-world malware samples

03

Demonstrates reliable malware generation across diverse tasks and datasets

Abstract

Large language models (LLMs) have democratized software development, reducing the expertise barrier for programming complex applications. This accessibility extends to malicious software development, raising significant security concerns. While LLM providers have implemented alignment mechanisms to prevent direct generation of overtly malicious code, these safeguards predominantly evaluate individual prompts in isolation, overlooking a critical vulnerability: malicious operations can be systematically decomposed into benign-appearing sub-tasks. In this paper, we introduce the Malware Generation Compiler (MGC), a novel framework that leverages this vulnerability through modular decomposition and alignment-evasive generation. MGC employs a specialized Malware Description Intermediate Representation (MDIR) to bridge high-level malicious intents and benign-appearing code snippets. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.