ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection
Shihao Weng, Yang Feng, Jinrui Zhang, Xiaofei Xie, Jiongchi Yu, Jia Liu

TL;DR
This paper introduces ARGUS, a provenance-based defense mechanism for LLM agents, and a benchmark, AgentLure, to evaluate context-aware prompt injection attacks, addressing limitations of prior defenses.
Contribution
The paper presents ARGUS, a novel provenance-aware auditing system, and AgentLure, a benchmark for context-dependent prompt injection attacks, improving security evaluation of LLM agents.
Findings
ARGUS reduces attack success rate to 3.8%.
ARGUS maintains 87.5% task utility.
Existing defenses perform poorly against context-aware attacks.
Abstract
The rise of Large Language Model (LLM) agents, augmented with tool use, skills, and external knowledge, has introduced new security risks. Among them, prompt injection attacks, where adversaries embed malicious instructions into the agent workflow, have emerged as the primary threat. However, existing benchmarks and defenses are fundamentally limited as they assume context-insensitive settings in which the agent works under a fully specified user instruction, and the attacks are straightforward and context-independent. As a result, they fail to capture real-world deployments where agent behavior usually depends on dynamic context, not just the user prompt, and adversaries can adapt their attacks to different context. Similarly, existing defenses built on this narrow threat model overlook the nature of real-world agent delegation. In this paper, we present AgentLure, a benchmark that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
