Assessing Utility of Differential Privacy for RCTs
Kaitlyn R. Webb, Soumya Mukherjee, Aratrika Mustafi, Aleksandra Slavkovi\'c, Lars Vilhuber

TL;DR
This paper evaluates how differential privacy techniques affect the analysis of randomized controlled trial data, demonstrating that privacy-preserving methods can maintain valid statistical inference in published research.
Contribution
It empirically assesses the impact of differential privacy algorithms on RCT analysis, highlighting necessary adjustments and demonstrating practical applicability.
Findings
Privacy-preserving methods can support valid statistical inference in RCTs.
Adjustments are needed for stability-based histogram algorithms.
Simulation studies confirm the feasibility of privacy-protected analysis.
Abstract
Randomized controlled trials (RCTs) have become powerful tools for assessing the impact of interventions and policies in many contexts. They are considered the gold standard for causal inference in the biomedical fields and many social sciences. Researchers have published an increasing number of studies that rely on RCTs for at least part of their inference. These studies typically include the response data that has been collected, de-identified, and sometimes protected through traditional disclosure limitation methods. In this paper, we empirically assess the impact of privacy-preserving synthetic data generation methodologies on published RCT analyses by leveraging available replication packages (research compendia) in economics and policy analysis. We implement three privacy-preserving algorithms, that use as a base one of the basic differentially private (DP) algorithms, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Health Systems, Economic Evaluations, Quality of Life · Statistical Methods and Bayesian Inference
