A Large Language Model Approach to Generating Bypass Rules for Malware Evasion in Analysis Sandbox

Zhiyong Sui; Lamine Noureddine; Mst Eshita Khatun; Sideeq Bello; Justin Woodring; Aisha Ali-Gombe

arXiv:2605.21821·cs.CR·May 22, 2026

A Large Language Model Approach to Generating Bypass Rules for Malware Evasion in Analysis Sandbox

Zhiyong Sui, Lamine Noureddine, Mst Eshita Khatun, Sideeq Bello, Justin Woodring, Aisha Ali-Gombe

PDF

TL;DR

This paper presents ABLE, a novel LLM-based system that automatically generates bypass rules for sandbox evasion in malware analysis, significantly improving detection and analysis of evasive malware.

Contribution

ABLE introduces an automated, scalable approach using large language models to generate and refine bypass rules, reducing reliance on manual reverse engineering.

Findings

01

Achieves 79% bypass success rate on real-world malware

02

Identifies 47% more malware families compared to existing platforms

03

Iterative refinement accounts for 29.5% of successful bypasses

Abstract

Sandbox evasion remains a critical challenge for automated malware analysis, as modern malware employs environment checks to detect analysis platforms and suppress malicious behavior. Existing approaches rely on manually crafted bypass rules that require deep reverse engineering of each evasion mechanism -an approach that cannot scale against rapidly evolving evasion techniques. In this paper, we leverage large language models (LLMs) to automatically generate YARA rules that bypass evasion checks in sandbox environments. We propose ABLE, which analyzes execution traces from malware terminated due to potentially evasive behavior and employs multiple reasoning strategies to generate targeted bypass rules. To address syntactic errors and improve the efficacy of the bypass rules in the LLM outputs, we introduce an auto-sanitization pipeline and feedback-driven iterative refinement. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.