IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection
Chia-Pei (Janet) Chen, Kentaroh Toyoda, Anita Lai, Alex Leung

TL;DR
This paper introduces IPI-proxy, an open-source tool that intercepts and rewrites web responses to test and improve the security of web-browsing AI agents against indirect prompt injection attacks.
Contribution
It provides a novel intercepting proxy with a library of attack payloads and a flexible evaluation framework for red-teaming AI agents in real-world web environments.
Findings
Embedded payloads can successfully trigger prompt injections in web-browsing AI agents.
The toolkit enables parameter-sweep evaluations without mock environments.
It bridges static benchmarks and live deployment for security assessment.
Abstract
Web-browsing AI agents are increasingly deployed in enterprise settings under strict whitelists of approved domains, yet adversaries can still influence them by embedding hidden instructions in the HTML pages those domains serve. Existing red-teaming resources fall short of this scenario: prompt-injection benchmarks ship pre-built adversarial pages that whitelisted agents cannot reach, and generic LLM scanners probe the model API rather than its retrieved content. We present IPI-proxy, an open-source toolkit for red-teaming web-browsing agents against indirect prompt injection (IPI). At its core is an intercepting proxy that rewrites real HTTP responses from whitelisted domains in flight, embedding payloads drawn from a unified library of 820 deduplicated attack strings extracted from six published benchmarks (BIPIA, InjecAgent, AgentDojo, Tensor Trust, WASP, and LLMail-Inject). A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
