Collecting Telemetry Data Privately
Bolin Ding, Janardhan Kulkarni, Sergey Yekhanin

TL;DR
This paper introduces new locally differentially private mechanisms for repeated collection of counter data, maintaining strong privacy guarantees over time while achieving accuracy comparable to single-round methods, validated through real-world datasets.
Contribution
The paper presents novel LDP algorithms specifically designed for repeated data collection, ensuring formal privacy guarantees over long periods, and demonstrates their effectiveness and deployment at scale.
Findings
Mechanisms achieve accuracy comparable to single-round LDP methods.
Empirical results confirm theoretical privacy and accuracy guarantees.
Deployed by Microsoft for telemetry collection on millions of devices.
Abstract
The collection and analysis of telemetry data from users' devices is routinely performed by many software companies. Telemetry collection leads to improved user experience but poses significant risks to users' privacy. Locally differentially private (LDP) algorithms have recently emerged as the main tool that allows data collectors to estimate various population statistics, while preserving privacy. The guarantees provided by such algorithms are typically very strong for a single round of telemetry collection, but degrade rapidly when telemetry is collected regularly. In particular, existing LDP algorithms are not suitable for repeated collection of counter data such as daily app usage statistics. In this paper, we develop new LDP mechanisms geared towards repeated collection of counter data, with formal privacy guarantees even after being executed for an arbitrarily long period of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Quality and Management · Mobile Crowdsensing and Crowdsourcing
