The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Peiran Wang; Xinfeng Li; Chong Xiang; Jinghuai Zhang; Ying Li; Lixia Zhang; Xiaofeng Wang; Yuan Tian

arXiv:2602.10453·cs.CR·February 12, 2026

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis

Peiran Wang, Xinfeng Li, Chong Xiang, Jinghuai Zhang, Ying Li, Lixia Zhang, Xiaofeng Wang, Yuan Tian

PDF

Open Access 1 Datasets

TL;DR

This paper provides a comprehensive overview and analysis of prompt injection threats in LLM agents, introduces a new benchmark for context-dependent tasks, and evaluates existing defenses highlighting their limitations.

Contribution

It offers a taxonomy of prompt injection attacks and defenses, introduces AgentPI benchmark for context-aware evaluation, and empirically assesses defense effectiveness in realistic scenarios.

Findings

01

Many defenses fail to handle context-dependent tasks effectively.

02

Existing benchmarks often overlook real-world environmental interactions.

03

No single defense approach achieves optimal trustworthiness, utility, and latency.

Abstract

The evolution of Large Language Models (LLMs) has resulted in a paradigm shift towards autonomous agents, necessitating robust security against Prompt Injection (PI) vulnerabilities where untrusted inputs hijack agent behaviors. This SoK presents a comprehensive overview of the PI landscape, covering attacks, defenses, and their evaluation practices. Through a systematic literature review and quantitative analysis, we establish taxonomies that categorize PI attacks by payload generation strategies (heuristic vs. optimization) and defenses by intervention stages (text, model, and execution levels). Our analysis reveals a key limitation shared by many existing defenses and benchmarks: they largely overlook context-dependent tasks, in which agents are authorized to rely on runtime environmental observations to determine actions. To address this gap, we introduce AgentPI, a new benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

prodnull/prompt-injection-repo-dataset
dataset· 43 dl
43 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Advanced Malware Detection Techniques