Locality-Sensitive Hashing Does Not Guarantee Privacy! Attacks on Google's FLoC and the MinHash Hierarchy System
Florian Turati (1), Carlos Cotrini (1), Karel Kubicek (1), David Basin, (1) ((1) ETH Zurich)

TL;DR
This paper demonstrates that locality-sensitive hashing systems like Google's FLoC and MinHash Hierarchy are vulnerable to attacks that compromise user privacy, refuting their claims of privacy guarantees.
Contribution
It provides concrete attack methods against FLoC and MinHash, showing they do not ensure pre-image resistance, anonymity, or differential privacy.
Findings
Deanonymized 30% of FLoC users using Sybil attacks.
Reconstructed over 10% of browsing history for some users.
Limited geographic movement to 10% using MinHash analysis.
Abstract
Recently proposed systems aim at achieving privacy using locality-sensitive hashing. We show how these approaches fail by presenting attacks against two such systems: Google's FLoC proposal for privacy-preserving targeted advertising and the MinHash Hierarchy, a system for processing mobile users' traffic behavior in a privacy-preserving way. Our attacks refute the pre-image resistance, anonymity, and privacy guarantees claimed for these systems. In the case of FLoC, we show how to deanonymize users using Sybil attacks and to reconstruct 10% or more of the browsing history for 30% of its users using Generative Adversarial Networks. We achieve this only analyzing the hashes used by FLoC. For MinHash, we precisely identify the movement of a subset of individuals and, on average, we can limit users' movement to just 10% of the possible geographic area, again using just the hashes. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInternet Traffic Analysis and Secure E-voting · Privacy, Security, and Data Protection · Human Mobility and Location-Based Analysis
