A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection
Xiao Li, Yue Li, Hao Wu, Yue Zhang, Yechao Zhang, Fengyuan Xu, Sheng Zhong

TL;DR
This paper systematically categorizes and evaluates various code obfuscation techniques across multiple programming languages and LLM models to understand their impact on vulnerability detection.
Contribution
It provides a comprehensive classification of obfuscation methods and a unified framework for assessing their effects on LLM-based vulnerability detection.
Findings
Obfuscation can both improve and degrade LLM detection performance.
Different obfuscation techniques have varied impacts depending on vulnerability and code properties.
The study highlights open challenges for making LLMs more robust against obfuscated code.
Abstract
As large language models (LLMs) are increasingly adopted for code vulnerability detection, their reliability and robustness across diverse vulnerability types have become a pressing concern. In traditional adversarial settings, code obfuscation has long been used as a general strategy to bypass auditing tools, preserving exploitability without tampering with the tools themselves. Numerous efforts have explored obfuscation methods and tools, yet their capabilities differ in terms of supported techniques, granularity, and programming languages, making it difficult to systematically assess their impact on LLM-based vulnerability detection. To address this gap, we provide a structured systematization of obfuscation techniques and evaluate them under a unified framework. Specifically, we categorize existing obfuscation methods into three major classes (layout, data flow, and control flow)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Security and Verification in Computing
