Private Information Disclosure from Web Searches. (The case of Google Web History)
Claude Castelluccia, Emiliano De Cristofaro, Daniele Perito

TL;DR
This paper reveals vulnerabilities in Google’s web services that allow reconstruction of users' search histories through a novel attack, raising significant privacy concerns about mixed secure and insecure web architectures.
Contribution
Introduces the Historiographer, a new attack method to reconstruct Google users' search history despite protections, highlighting privacy risks in web service architectures.
Findings
Many Google services are vulnerable to session hijacking.
The Historiographer can reconstruct search history from personalized suggestions.
Attacks are applicable to other services with mixed security protocols.
Abstract
As the amount of personal information stored at remote service providers increases, so does the danger of data theft. When connections to remote services are made in the clear and authenticated sessions are kept using HTTP cookies, data theft becomes extremely easy to achieve. In this paper, we study the architecture of the world's largest service provider, i.e., Google. First, with the exception of a few services that can only be accessed over HTTPS (e.g., Gmail), we find that many Google services are still vulnerable to simple session hijacking. Next, we present the Historiographer, a novel attack that reconstructs the web search history of Google users, i.e., Google's Web History, even though such a service is supposedly protected from session hijacking by a stricter access control policy. The Historiographer uses a reconstruction technique inferring search history from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Spam and Phishing Detection · Data-Driven Disease Surveillance
