Effective Code Membership Inference for Code Completion Models via Adversarial Prompts

Yuan Jiang; Zehao Li; Shan Huang; Christoph Treude; Xiaohong Su; Tiantian Wang

arXiv:2511.15107·cs.SE·November 20, 2025

Effective Code Membership Inference for Code Completion Models via Adversarial Prompts

Yuan Jiang, Zehao Li, Shan Huang, Christoph Treude, Xiaohong Su, Tiantian Wang

PDF

Open Access

TL;DR

This paper introduces AdvPrompt-MIA, a novel adversarial prompt-based membership inference attack on code completion models that effectively detects training data membership with high accuracy and transferability.

Contribution

We propose AdvPrompt-MIA, a new method combining adversarial prompts and deep learning to improve membership inference on code models, surpassing existing techniques in accuracy and generalizability.

Findings

01

Outperforms state-of-the-art baselines with up to 102% AUC gain.

02

Demonstrates strong transferability across models and datasets.

03

Effectively captures nuanced memorization patterns in code models.

Abstract

Membership inference attacks (MIAs) on code completion models offer an effective way to assess privacy risks by inferring whether a given code snippet was part of the training data. Existing black- and gray-box MIAs rely on expensive surrogate models or manually crafted heuristic rules, which limit their ability to capture the nuanced memorization patterns exhibited by over-parameterized code language models. To address these challenges, we propose AdvPrompt-MIA, a method specifically designed for code completion models, combining code-specific adversarial perturbations with deep learning. The core novelty of our method lies in designing a series of adversarial prompts that induce variations in the victim code model's output. By comparing these outputs with the ground-truth completion, we construct feature vectors to train a classifier that automatically distinguishes member from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Software Engineering Research