CoTDeceptor:Adversarial Code Obfuscation Against CoT-Enhanced LLM Code Agents

Haoyang Li; Mingjin Li; Jinxin Zuo; Siqi Li; Xiao Li; Hao Wu; Yueming Lu; Xiaochuan He

arXiv:2512.21250·cs.CR·December 25, 2025

CoTDeceptor:Adversarial Code Obfuscation Against CoT-Enhanced LLM Code Agents

Haoyang Li, Mingjin Li, Jinxin Zuo, Siqi Li, Xiao Li, Hao Wu, Yueming Lu, Xiaochuan He

PDF

Open Access

TL;DR

This paper introduces CoTDeceptor, an adversarial framework that effectively evades CoT-enhanced LLM vulnerability detectors by constructing complex obfuscation strategies, revealing significant security risks in software supply chains.

Contribution

We present the first adversarial obfuscation framework targeting CoT-enhanced LLM detectors, demonstrating its ability to bypass most vulnerability categories and exposing weaknesses in current security methods.

Findings

01

CoTDeceptor achieves high evasion success across multiple vulnerability categories.

02

It bypasses 14 out of 15 vulnerability categories, outperforming prior methods.

03

The framework reveals systematic weaknesses in CoT-based security detection.

Abstract

LLM-based code agents(e.g., ChatGPT Codex) are increasingly deployed as detector for code review and security auditing tasks. Although CoT-enhanced LLM vulnerability detectors are believed to provide improved robustness against obfuscated malicious code, we find that their reasoning chains and semantic abstraction processes exhibit exploitable systematic weaknesses.This allows attackers to covertly embed malicious logic, bypass code review, and propagate backdoored components throughout real-world software supply chains.To investigate this issue, we present CoTDeceptor, the first adversarial code obfuscation framework targeting CoT-enhanced LLM detectors. CoTDeceptor autonomously constructs evolving, hard-to-reverse multi-stage obfuscation strategy chains that effectively disrupt CoT-driven detection logic.We obtained malicious code provided by security enterprise, experimental results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Adversarial Robustness in Machine Learning