ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection

Shihao Weng; Yang Feng; Jinrui Zhang; Xiaofei Xie; Jiongchi Yu; Jia Liu

arXiv:2605.03378·cs.CR·May 6, 2026

ARGUS: Defending LLM Agents Against Context-Aware Prompt Injection

Shihao Weng, Yang Feng, Jinrui Zhang, Xiaofei Xie, Jiongchi Yu, Jia Liu

PDF

TL;DR

This paper introduces ARGUS, a provenance-based defense mechanism for LLM agents, and a benchmark, AgentLure, to evaluate context-aware prompt injection attacks, addressing limitations of prior defenses.

Contribution

The paper presents ARGUS, a novel provenance-aware auditing system, and AgentLure, a benchmark for context-dependent prompt injection attacks, improving security evaluation of LLM agents.

Findings

01

ARGUS reduces attack success rate to 3.8%.

02

ARGUS maintains 87.5% task utility.

03

Existing defenses perform poorly against context-aware attacks.

Abstract

The rise of Large Language Model (LLM) agents, augmented with tool use, skills, and external knowledge, has introduced new security risks. Among them, prompt injection attacks, where adversaries embed malicious instructions into the agent workflow, have emerged as the primary threat. However, existing benchmarks and defenses are fundamentally limited as they assume context-insensitive settings in which the agent works under a fully specified user instruction, and the attacks are straightforward and context-independent. As a result, they fail to capture real-world deployments where agent behavior usually depends on dynamic context, not just the user prompt, and adversaries can adapt their attacks to different context. Similarly, existing defenses built on this narrow threat model overlook the nature of real-world agent delegation. In this paper, we present AgentLure, a benchmark that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.