WhoTracks .Me: Shedding light on the opaque world of online tracking
Arjaldo Karaj, Sam Macbeth, R\'emi Berson, Josep M. Pujol

TL;DR
This paper introduces a large-scale, privacy-preserving measurement method for online tracking using a browser extension, providing the most comprehensive real-world data on tracking behaviors over a year.
Contribution
It presents a novel browser extension-based measurement approach and deploys it to over 5 million users, enabling accurate, long-term tracking analysis across diverse environments.
Findings
Largest dataset of real-world tracking over 1.5 billion page loads
Tracking behaviors vary significantly across countries and ISPs
Provides transparency and insights into online tracking practices
Abstract
Online tracking has become of increasing concern in recent years, however our understanding of its extent to date has been limited to snapshots from web crawls. Previous at-tempts to measure the tracking ecosystem, have been done using instrumented measurement platforms, which are not able to accurately capture how people interact with the web. In this work we present a method for the measurement of tracking in the web through a browser extension, as well as a method for the aggregation and collection of this information which protects the privacy of participants. We deployed this extension to more than 5 million users, enabling measurement across multiple countries, ISPs and browser configurations, to give an accurate picture of real-world tracking. The result is the largest and longest measurement of online tracking to date based on real users, covering 1.5 billion page loads gathered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy, Security, and Data Protection · Privacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing
