Frequency Estimation of Correlated Multi-attribute Data under Local Differential Privacy
Shafizur Rahman Seeam, Ye Zheng, Yidan Hu

TL;DR
This paper presents a novel two-phase local differential privacy framework that leverages inter-attribute correlations to improve frequency estimation accuracy in high-dimensional, multi-attribute datasets.
Contribution
It introduces a two-phase LDP method that privately learns and exploits attribute dependencies, significantly reducing noise and utility loss in correlated data.
Findings
Corr-RR outperforms existing methods in accuracy
Significant improvements in high-dimensional datasets
Effective in strongly correlated attribute scenarios
Abstract
Large-scale data collection, from national censuses to IoT-enabled smart homes, routinely gathers dozens of attributes per individual. These multi-attribute datasets are crucial for analytics but pose significant privacy risks. Local Differential Privacy (LDP) is a powerful tool for protecting user privacy by allowing users to locally perturb their records before releasing them to an untrusted data aggregator. However, existing LDP mechanisms either split the privacy budget across all attributes or treat each attribute independently, thereby ignoring natural inter-attribute correlations. This leads to excessive noise and, consequently, significant utility loss, particularly in high-dimensional datasets. We introduce a two-phase LDP framework that overcomes these limitations by privately learning and exploiting inter-attribute dependencies. In Phase~I, a small subset of users applies a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Privacy-Preserving Technologies in Data · Probability and Risk Models
