MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-based Programming Assistants
John Heibel, Daniel Lowd

TL;DR
This paper introduces MaPPing Your Model, a prompt-based attack that subtly manipulates LLMs to introduce security vulnerabilities in generated code, demonstrating its effectiveness across various models and highlighting security risks.
Contribution
The paper presents the MaPP attack, a novel prompt manipulation technique that causes LLMs to add vulnerabilities, revealing security risks in LLM-based programming assistants.
Findings
MaPP prompts are effective across multiple LLMs.
Scaling models does not mitigate MaPP attack effectiveness.
MaPP can target specific vulnerabilities in generated code.
Abstract
LLM-based programming assistants offer the promise of programming faster but with the risk of introducing more security vulnerabilities. Prior work has studied how LLMs could be maliciously fine-tuned to suggest vulnerabilities more often. With the rise of agentic LLMs, which may use results from an untrusted third party, there is a growing risk of attacks on the model's prompt. We introduce the Malicious Programming Prompt (MaPP) attack, in which an attacker adds a small amount of text to a prompt for a programming task (under 500 bytes). We show that our prompt strategy can cause an LLM to add vulnerabilities while continuing to write otherwise correct code. We evaluate three prompts on seven common LLMs, from basic to state-of-the-art commercial models. Using the HumanEval benchmark, we find that our prompts are broadly effective, with no customization required for different LLMs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Security and Verification in Computing
