IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection

Chia-Pei (Janet) Chen; Kentaroh Toyoda; Anita Lai; Alex Leung

arXiv:2605.11868·cs.CR·May 13, 2026

IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection

Chia-Pei (Janet) Chen, Kentaroh Toyoda, Anita Lai, Alex Leung

PDF

TL;DR

This paper introduces IPI-proxy, an open-source tool that intercepts and rewrites web responses to test and improve the security of web-browsing AI agents against indirect prompt injection attacks.

Contribution

It provides a novel intercepting proxy with a library of attack payloads and a flexible evaluation framework for red-teaming AI agents in real-world web environments.

Findings

01

Embedded payloads can successfully trigger prompt injections in web-browsing AI agents.

02

The toolkit enables parameter-sweep evaluations without mock environments.

03

It bridges static benchmarks and live deployment for security assessment.

Abstract

Web-browsing AI agents are increasingly deployed in enterprise settings under strict whitelists of approved domains, yet adversaries can still influence them by embedding hidden instructions in the HTML pages those domains serve. Existing red-teaming resources fall short of this scenario: prompt-injection benchmarks ship pre-built adversarial pages that whitelisted agents cannot reach, and generic LLM scanners probe the model API rather than its retrieved content. We present IPI-proxy, an open-source toolkit for red-teaming web-browsing agents against indirect prompt injection (IPI). At its core is an intercepting proxy that rewrites real HTTP responses from whitelisted domains in flight, embedding payloads drawn from a unified library of 820 deduplicated attack strings extracted from six published benchmarks (BIPIA, InjecAgent, AgentDojo, Tensor Trust, WASP, and LLMail-Inject). A…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.