BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

Kaiyuan Zhang; Mark Tenenholtz; Kyle Polley; Jerry Ma; Denis Yarats; Ninghui Li

arXiv:2511.20597·cs.LG·November 26, 2025

BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents

Kaiyuan Zhang, Mark Tenenholtz, Kyle Polley, Jerry Ma, Denis Yarats, Ninghui Li

PDF

Open Access 1 Models 2 Datasets

TL;DR

This paper investigates prompt injection attacks on AI browser agents, creating a realistic benchmark and evaluating defenses, ultimately proposing a multi-layered strategy to enhance security in web-based AI systems.

Contribution

It introduces a comprehensive benchmark for prompt injection attacks in realistic HTML contexts and evaluates defenses, proposing a multi-layered security approach for AI browser agents.

Findings

01

Existing defenses have limited effectiveness against complex prompt injections.

02

The benchmark reveals vulnerabilities in current AI browser agents.

03

A multi-layered defense strategy improves security against prompt injection attacks.

Abstract

The integration of artificial intelligence (AI) agents into web browsers introduces security challenges that go beyond traditional web application threat models. Prior work has identified prompt injection as a new attack vector for web agents, yet the resulting impact within real-world environments remains insufficiently understood. In this work, we examine the landscape of prompt injection attacks and synthesize a benchmark of attacks embedded in realistic HTML payloads. Our benchmark goes beyond prior work by emphasizing injections that can influence real-world actions rather than mere text outputs, and by presenting attack payloads with complexity and distractor frequency similar to what real-world agents encounter. We leverage this benchmark to conduct a comprehensive empirical evaluation of existing defenses, assessing their effectiveness across a suite of frontier AI models. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
perplexity-ai/browsesafe
model· 375 dl· ♡ 45
375 dl♡ 45

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Application Security Vulnerabilities · Security and Verification in Computing · Advanced Malware Detection Techniques