MaPPing Your Model: Assessing the Impact of Adversarial Attacks on   LLM-based Programming Assistants

John Heibel; Daniel Lowd

arXiv:2407.11072·cs.CR·July 17, 2024

MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-based Programming Assistants

John Heibel, Daniel Lowd

PDF

Open Access

TL;DR

This paper introduces MaPPing Your Model, a prompt-based attack that subtly manipulates LLMs to introduce security vulnerabilities in generated code, demonstrating its effectiveness across various models and highlighting security risks.

Contribution

The paper presents the MaPP attack, a novel prompt manipulation technique that causes LLMs to add vulnerabilities, revealing security risks in LLM-based programming assistants.

Findings

01

MaPP prompts are effective across multiple LLMs.

02

Scaling models does not mitigate MaPP attack effectiveness.

03

MaPP can target specific vulnerabilities in generated code.

Abstract

LLM-based programming assistants offer the promise of programming faster but with the risk of introducing more security vulnerabilities. Prior work has studied how LLMs could be maliciously fine-tuned to suggest vulnerabilities more often. With the rise of agentic LLMs, which may use results from an untrusted third party, there is a growing risk of attacks on the model's prompt. We introduce the Malicious Programming Prompt (MaPP) attack, in which an attacker adds a small amount of text to a prompt for a programming task (under 500 bytes). We show that our prompt strategy can cause an LLM to add vulnerabilities while continuing to write otherwise correct code. We evaluate three prompts on seven common LLMs, from basic to state-of-the-art commercial models. Using the HumanEval benchmark, we find that our prompts are broadly effective, with no customization required for different LLMs.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Software Engineering Research · Security and Verification in Computing