Improving Utility for Privacy-Preserving Analysis of Correlated Columns using Pufferfish Privacy
Krystal Maughan, Joseph P. Near

TL;DR
This paper introduces the Tabular DDP Mechanism, a new privacy-preserving method for high-dimensional survey data that improves accuracy by accounting for data correlations using dependent differential privacy.
Contribution
The paper presents a novel privacy mechanism that calibrates noise based on data correlation, enhancing accuracy in high-dimensional, correlated survey data analysis.
Findings
Significantly improves accuracy over Laplace mechanism
Effectively handles incomplete correlation in high-dimensional data
Empirical results demonstrate practical utility of the mechanism
Abstract
Surveys are an important tool for many areas of social science research, but privacy concerns can complicate the collection and analysis of survey data. Differentially private analyses of survey data can address these concerns, but at the cost of accuracy - especially for high-dimensional statistics. We present a novel privacy mechanism, the Tabular DDP Mechanism, designed for high-dimensional statistics with incomplete correlation. The Tabular DDP Mechanism satisfies dependent differential privacy, a variant of Pufferfish privacy; it works by building a causal model of the sensitive data, then calibrating noise to the level of correlation between statistics. An empirical evaluation on survey data shows that the Tabular DDP Mechanism can significantly improve accuracy over the Laplace mechanism.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurvey Methodology and Nonresponse · Privacy-Preserving Technologies in Data · Census and Population Estimation
