Eliminating Backdoors in Neural Code Models for Secure Code   Understanding

Weisong Sun; Yuchen Chen; Chunrong Fang; Yebo Feng; Yuan; Xiao; An Guo; Quanjun Zhang; Yang Liu; Baowen Xu; Zhenyu Chen

arXiv:2408.04683·cs.CR·February 21, 2025

Eliminating Backdoors in Neural Code Models for Secure Code Understanding

Weisong Sun, Yuchen Chen, Chunrong Fang, Yebo Feng, Yuan, Xiao, An Guo, Quanjun Zhang, Yang Liu, Baowen Xu, Zhenyu Chen

PDF

Open Access

TL;DR

This paper introduces EliBadCode, a novel method to detect and eliminate backdoors in neural code models by inverting triggers and unlearning them, enhancing security in code understanding tasks.

Contribution

EliBadCode is the first approach to efficiently identify and remove backdoors in neural code models using trigger inversion and model unlearning techniques.

Findings

01

EliBadCode effectively removes backdoors in multiple neural code models.

02

The method minimally impacts the models' normal functionality.

03

EliBadCode outperforms existing defenses in backdoor elimination.

Abstract

Neural code models (NCMs) have been widely used to address various code understanding tasks, such as defect detection. However, numerous recent studies reveal that such models are vulnerable to backdoor attacks. Backdoored NCMs function normally on normal/clean code snippets, but exhibit adversary-expected behavior on poisoned code snippets injected with the adversary-crafted trigger. It poses a significant security threat. Therefore, there is an urgent need for effective techniques to detect and eliminate backdoors stealthily implanted in NCMs. To address this issue, in this paper, we innovatively propose a backdoor elimination technique for secure code understanding, called EliBadCode. EliBadCode eliminates backdoors in NCMs by inverting/reverse-engineering and unlearning backdoor triggers. Specifically, EliBadCode first filters the model vocabulary for trigger tokens based on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Advanced Malware Detection Techniques · Software Reliability and Analysis Research