Detecting Data Poisoning in Code Generation LLMs via Black-Box, Vulnerability-Oriented Scanning

Shenao Yan; Shimaa Ahmed; Shan Jin; Sunpreet S. Arora; Yiwei Cai; Yizhen Wang; Yuan Hong

arXiv:2603.17174·cs.CR·March 19, 2026

Detecting Data Poisoning in Code Generation LLMs via Black-Box, Vulnerability-Oriented Scanning

Shenao Yan, Shimaa Ahmed, Shan Jin, Sunpreet S. Arora, Yiwei Cai, Yizhen Wang, Yuan Hong

PDF

Open Access

TL;DR

This paper introduces CodeScan, a novel framework for detecting data poisoning in code generation large language models by analyzing structural similarities in generated code and applying vulnerability analysis, achieving over 97% detection accuracy.

Contribution

CodeScan is the first poisoning detection framework tailored for code generation models that uses structural analysis and vulnerability assessment to identify compromised models.

Findings

01

Achieves over 97% detection accuracy across multiple models and attack types.

02

Outperforms prior methods with lower false positive rates.

03

Effective against diverse backdoor and poisoning attacks.

Abstract

Code generation large language models (LLMs) are increasingly integrated into modern software development workflows. Recent work has shown that these models are vulnerable to backdoor and poisoning attacks that induce the generation of insecure code, yet effective defenses remain limited. Existing scanning approaches rely on token-level generation consistency to invert attack targets, which is ineffective for source code where identical semantics can appear in diverse syntactic forms. We present CodeScan, which, to the best of our knowledge, is the first poisoning-scanning framework tailored to code generation models. CodeScan identifies attack targets by analyzing structural similarities across multiple generations conditioned on different clean prompts. It combines iterative divergence analysis with abstract syntax tree (AST)-based normalization to abstract away surface-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Software Engineering Research · Web Application Security Vulnerabilities