TL;DR
This paper demonstrates that passive JavaScript tracking can accurately identify the underlying LLM-based web browsing agent with high precision, posing security risks and revealing vulnerabilities in agent anonymity.
Contribution
It introduces a passive fingerprinting method for LLM agents via UI traces, showing high accuracy and robustness across models, sizes, and environments.
Findings
Agents can be identified with up to 96% F1 score.
Classifiers generalize across different models and sizes.
Timing delays do not fully prevent fingerprinting.
Abstract
As LLM-based agents increasingly browse the web on users' behalf, a natural question arises: can websites passively identify which underlying model powers an agent? Doing so would represent a significant security risk, enabling targeted attacks tailored to known model vulnerabilities. Across 14 frontier LLMs and four web environments spanning information retrieval and shopping tasks, we show that an agent's actions and interaction timings, captured via a passive JavaScript tracker, are sufficient to identify the underlying model with up to 96\% F1. We formalise this attack surface by demonstrating that classifiers trained on agent actions generalise across model sizes and families. We further show that strong classifiers can be trained from few interaction traces and that agent identity can be inferred early within an episode. Injecting randomised timing delays between actions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
