Data Privacy and Specimen Pooling: Using an old tool for New Challenges
Paramita Saha-Chaudhuri, Clarice Weinberg

TL;DR
This paper demonstrates that specimen pooling, an epidemiologic tool, can be adapted as a privacy-preserving method to analyze confidential data in case-control studies, maintaining data utility while protecting individual privacy.
Contribution
The study introduces a novel application of specimen pooling for privacy-preserving data analysis in epidemiology, enabling estimation of odds ratios from aggregate data.
Findings
Parameter estimates from pooled data closely match individual data estimates.
Pooling maintains similar standard errors and confidence intervals.
Method is effective for both matched and unmatched case-control designs.
Abstract
Background: In the context of ongoing debate over data confidentiality versus shared use of research data, as raised following the new EU General Data Protection Regulation, we seek to find alternate techniques that can balance these two issues. In particular, we demonstrate that an existing epidemiologic tool, specimen pooling, can be adapted as a privacy-preserving method to enable data analysis while maintaining data confidentiality. Specimen pooling is a cost-effective tool in studying the effect of an expensive-to-measure exposure on a disease outcome, for both unmatched and matched case-control designs. We propose the technique in a new context to analyze confidential data and demonstrate that it can be successfully used to estimate OR of covariates based on aggregate data when individual patient data cannot be shared. Methods: We demonstrate the application of specimen pooling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Ethics in Clinical Research · Privacy, Security, and Data Protection
