FDI: Attack Neural Code Generation Systems through User Feedback Channel

Zhensu Sun; Xiaoning Du; Xiapu Luo; Fu Song; David Lo; Li Li

arXiv:2408.04194·cs.SE·August 9, 2024

FDI: Attack Neural Code Generation Systems through User Feedback Channel

Zhensu Sun, Xiaoning Du, Xiapu Luo, Fu Song, David Lo, Li Li

PDF

Open Access 1 Repo

TL;DR

This paper reveals that user feedback channels in neural code generation systems are vulnerable to injection attacks, which can manipulate generated code to include vulnerabilities or malicious content, highlighting security risks.

Contribution

The paper systematically analyzes feedback mechanisms in neural code systems, demonstrating how they can be exploited through FDI attacks to inject malicious prompts or backdoors.

Findings

01

Feedback mechanisms are vulnerable to injection attacks.

02

FDI attacks can manipulate code generation to include vulnerabilities.

03

Stealthy attacks can introduce malicious code or spam.

Abstract

Neural code generation systems have recently attracted increasing attention to improve developer productivity and speed up software development. Typically, these systems maintain a pre-trained neural model and make it available to general users as a service (e.g., through remote APIs) and incorporate a feedback mechanism to extensively collect and utilize the users' reaction to the generated code, i.e., user feedback. However, the security implications of such feedback have not yet been explored. With a systematic study of current feedback mechanisms, we find that feedback makes these systems vulnerable to feedback data injection (FDI) attacks. We discuss the methodology of FDI attacks and present a pre-attack profiling strategy to infer the attack constraints of a targeted system in the black-box setting. We demonstrate two proof-of-concept examples utilizing the FDI attack surface to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

v587su/FDI
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques