Calibrate: Frequency Estimation and Heavy Hitter Identification with Local Differential Privacy via Incorporating Prior Knowledge
Jinyuan Jia, Neil Zhenqiang Gong

TL;DR
This paper introduces Calibrate, a method that enhances local differential privacy algorithms for frequency estimation and heavy hitter detection by incorporating prior knowledge, leading to significantly improved accuracy in real-world datasets.
Contribution
Calibrate is a novel approach that leverages prior knowledge through statistical inference to improve LDP-based frequency estimation and heavy hitter identification.
Findings
Calibrate outperforms existing LDP algorithms in accuracy.
It effectively incorporates prior knowledge to reduce estimation errors.
Empirical results demonstrate significant improvements on real datasets.
Abstract
Estimating frequencies of certain items among a population is a basic step in data analytics, which enables more advanced data analytics (e.g., heavy hitter identification, frequent pattern mining), client software optimization, and detecting unwanted or malicious hijacking of user settings in browsers. Frequency estimation and heavy hitter identification with local differential privacy (LDP) protect user privacy as well as the data collector. Existing LDP algorithms cannot leverage 1) prior knowledge about the noise in the estimated item frequencies and 2) prior knowledge about the true item frequencies. As a result, they achieve suboptimal performance in practice. In this work, we aim to design LDP algorithms that can leverage such prior knowledge. Specifically, we design to incorporate the prior knowledge via statistical inference. can be appended to an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Human Mobility and Location-Based Analysis
