Improving the Variance of Differentially Private Randomized Experiments through Clustering
Adel Javanmard, Vahab Mirrokni, Jean Pouget-Abadie

TL;DR
This paper introduces Cluster-DP, a new differentially private mechanism that uses data clustering to reduce variance in causal effect estimation, balancing privacy and accuracy more effectively.
Contribution
The paper proposes Cluster-DP, a novel privacy mechanism leveraging data clustering to improve the privacy-variance trade-off in causal analysis.
Findings
Cluster-DP reduces variance compared to baseline methods.
Higher-quality clusters lead to better privacy-variance trade-offs.
Theoretical and empirical evaluations confirm the effectiveness of Cluster-DP.
Abstract
Estimating causal effects from randomized experiments is only possible if participants are willing to disclose their potentially sensitive responses. Differential privacy, a widely used framework for ensuring an algorithms privacy guarantees, can encourage participants to share their responses without the risk of de-anonymization. However, many mechanisms achieve differential privacy by adding noise to the original dataset, which reduces the precision of causal effect estimation. This introduces a fundamental trade-off between privacy and variance when performing causal analyses on differentially private data. In this work, we propose a new differentially private mechanism, "Cluster-DP", which leverages a given cluster structure in the data to improve the privacy-variance trade-off. While our results apply to any clustering, we demonstrate that selecting higher-quality clusters,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Bayesian Inference · Privacy-Preserving Technologies in Data
