To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt

Zhilong Wang; Neha Nagaraja; Lan Zhang; Hayretdin Bahsi; Pawan Patil; Peng Liu

arXiv:2506.05739·cs.CR·June 9, 2025

To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt

Zhilong Wang, Neha Nagaraja, Lan Zhang, Hayretdin Bahsi, Pawan Patil, Peng Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Polymorphic Prompt Assembling, a lightweight method that dynamically varies prompt structures to defend LLM agents against prompt injection attacks without significant performance loss.

Contribution

The paper presents a novel, dynamic prompt variation technique that effectively prevents prompt injection attacks with minimal overhead.

Findings

01

PPA significantly reduces success rate of prompt injection attacks.

02

PPA maintains high performance and usability of LLM agents.

03

Compared to existing defenses, PPA offers superior security with lower overhead.

Abstract

LLM agents are widely used as agents for customer support, content generation, and code assistance. However, they are vulnerable to prompt injection attacks, where adversarial inputs manipulate the model's behavior. Traditional defenses like input sanitization, guard models, and guardrails are either cumbersome or ineffective. In this paper, we propose a novel, lightweight defense mechanism called Polymorphic Prompt Assembling (PPA), which protects against prompt injection with near-zero overhead. The approach is based on the insight that prompt injection requires guessing and breaking the structure of the system prompt. By dynamically varying the structure of system prompts, PPA prevents attackers from predicting the prompt structure, thereby enhancing security without compromising performance. We conducted experiments to evaluate the effectiveness of PPA against existing attacks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhilongwang/llmagentprotector
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning

MethodsIs Venmo Customer Support Available 24/7? How to Reach a Real Person