Semantically-Equivalent Transformations-Based Backdoor Attacks against Neural Code Models: Characterization and Mitigation
Junyao Ye, Zhen Li, Xi Tang, Shouhuai Xu, Deqing Zou, and Zhongsheng Yuan

TL;DR
This paper introduces semantically-equivalent transformation backdoor attacks on neural code models, demonstrating their high success and stealthiness, and evaluates the limited effectiveness of existing defenses.
Contribution
It presents a novel SET-based backdoor attack framework that is more stealthy and resilient against defenses compared to traditional injection-based attacks.
Findings
SET-based attacks achieve over 90% success rate
They evade defenses with detection rates 25% lower
Normalization defenses offer only partial mitigation
Abstract
Neural code models have been increasingly incorporated into software development processes. However, their susceptibility to backdoor attacks presents a significant security risk. The state-of-the-art understanding focuses on injection-based attacks, which insert anomalous patterns into software code. These attacks can be neutralized by standard sanitization techniques. This status quo may lead to a false sense of security regarding backdoor attacks. In this paper, we introduce a new kind of backdoor attacks, dubbed Semantically-Equivalent Transformation (SET)-based backdoor attacks, which use semantics-preserving low-prevalence code transformations to generate stealthy triggers. We propose a framework to guide the generation of such triggers. Our experiments across five tasks, six languages, and models like CodeBERT, CodeT5, and StarCoder show that SET-based attacks achieve high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Software Engineering Research · Advanced Malware Detection Techniques
