Privacy-Preserving Randomized Controlled Trials: A Protocol for Industry Scale Deployment
Mahnush Movahedi, Benjamin M. Case, Andrew Knox, James Honaker, Li Li,, Yiming Paul Li, Sanjay Saravanan, Shubho Sengupta, Erik Taubeneck

TL;DR
This paper presents a scalable, privacy-preserving protocol for conducting large-scale randomized controlled trials across multiple parties, ensuring data privacy and integrity while analyzing hundreds of millions of data points.
Contribution
It introduces a three-stage protocol combining secret sharing, MPC, and differential privacy for industry-scale RCTs, enabling privacy-preserving analysis of massive datasets.
Findings
Successfully deployed on over 500 million data rows
Achieved formal privacy guarantees with differential privacy
Demonstrated practical implementation in an ads effectiveness product
Abstract
In this paper, we outline a way to deploy a privacy-preserving protocol for multiparty Randomized Controlled Trials on the scale of 500 million rows of data and more than a billion gates. Randomized Controlled Trials (RCTs) are widely used to improve business and policy decisions in various sectors such as healthcare, education, criminology, and marketing. A Randomized Controlled Trial is a scientifically rigorous method to measure the effectiveness of a treatment. This is accomplished by randomly allocating subjects to two or more groups, treating them differently, and then comparing the outcomes across groups. In many scenarios, multiple parties hold different parts of the data for conducting and analyzing RCTs. Given privacy requirements and expectations of each of these parties, it is often challenging to have a centralized store of data to conduct and analyze RCTs. We accomplish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Ethics in Clinical Research
