Snoopy: A Webpage Fingerprinting Framework with Finite Query Model for Mass-Surveillance
Gargi Mitra, Prasanna Karthik Vairam, Sandip Saha, Nitin, Chandrachoodan, V. Kamakoti

TL;DR
Snoopy is a novel webpage fingerprinting framework designed for mass-surveillance that accurately identifies visited websites across diverse user behaviors while respecting query limits, using static analysis and resource size sequences.
Contribution
Snoopy introduces the first framework for large-scale webpage fingerprinting under finite query constraints, combining static analysis with resource size features for broad user context generalization.
Findings
Achieves ~90% accuracy across various websites and contexts.
Ensemble with ML techniques reaches ~97% accuracy under query limits.
Effective static analysis predicts variations from diverse browsing environments.
Abstract
Internet users are vulnerable to privacy attacks despite the use of encryption. Webpage fingerprinting, an attack that analyzes encrypted traffic, can identify the webpages visited by a user in a given website. Recent research works have been successful in demonstrating webpage fingerprinting attacks on individual users, but have been unsuccessful in extending their attack for mass-surveillance. The key challenges in performing mass-scale webpage fingerprinting arises from (i) the sheer number of combinations of user behavior and preferences to account for, and; (ii) the bound on the number of website queries imposed by the defense mechanisms (e.g., DDoS defense) deployed at the website. These constraints preclude the use of conventional data-intensive ML-based techniques. In this work, we propose Snoopy, a first-of-its-kind framework, that performs webpage fingerprinting for a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection · Spam and Phishing Detection
