Randomize the Future: Asymptotically Optimal Locally Private Frequency Estimation Protocol for Longitudinal Data
Olga Ohrimenko, Anthony Wirth, Hao Wu

TL;DR
This paper introduces a new locally private frequency estimation protocol for longitudinal data that significantly reduces error dependence on user data changes, achieving near-optimal accuracy with an online approach.
Contribution
It presents a novel randomizer, FutureRand, that correlates noise and pre-computes results, breaking the linear error dependence on data change frequency in LDP longitudinal data estimation.
Findings
Error bound scales polylogarithmically with data periods
Protocol matches lower bounds up to logarithmic factors
Achieves online frequency estimation with improved accuracy
Abstract
Longitudinal data tracking under Local Differential Privacy (LDP) is a challenging task. Baseline solutions that repeatedly invoke a protocol designed for one-time computation lead to linear decay in the privacy or utility guarantee with respect to the number of computations. To avoid this, the recent approach of Erlingsson et al. (2020) exploits the potential sparsity of user data that changes only infrequently. Their protocol targets the fundamental problem of frequency estimation protocol for longitudinal binary data, with error of , where is the privacy budget, is the number of time periods, is the maximum number of changes of user data, and is the failure probability. Notably, the error bound scales polylogarithmically with , but linearly with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Probability and Risk Models · Medical Imaging and Pathology Studies
