Semantically-Equivalent Transformations-Based Backdoor Attacks against Neural Code Models: Characterization and Mitigation

Junyao Ye; Zhen Li; Xi Tang; Shouhuai Xu; Deqing Zou; and Zhongsheng Yuan

arXiv:2512.19215·cs.SE·December 23, 2025

Semantically-Equivalent Transformations-Based Backdoor Attacks against Neural Code Models: Characterization and Mitigation

Junyao Ye, Zhen Li, Xi Tang, Shouhuai Xu, Deqing Zou, and Zhongsheng Yuan

PDF

Open Access

TL;DR

This paper introduces semantically-equivalent transformation backdoor attacks on neural code models, demonstrating their high success and stealthiness, and evaluates the limited effectiveness of existing defenses.

Contribution

It presents a novel SET-based backdoor attack framework that is more stealthy and resilient against defenses compared to traditional injection-based attacks.

Findings

01

SET-based attacks achieve over 90% success rate

02

They evade defenses with detection rates 25% lower

03

Normalization defenses offer only partial mitigation

Abstract

Neural code models have been increasingly incorporated into software development processes. However, their susceptibility to backdoor attacks presents a significant security risk. The state-of-the-art understanding focuses on injection-based attacks, which insert anomalous patterns into software code. These attacks can be neutralized by standard sanitization techniques. This status quo may lead to a false sense of security regarding backdoor attacks. In this paper, we introduce a new kind of backdoor attacks, dubbed Semantically-Equivalent Transformation (SET)-based backdoor attacks, which use semantics-preserving low-prevalence code transformations to generate stealthy triggers. We propose a framework to guide the generation of such triggers. Our experiments across five tasks, six languages, and models like CodeBERT, CodeT5, and StarCoder show that SET-based attacks achieve high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software Engineering Research · Advanced Malware Detection Techniques