GOPHER: Optimization-based Phenotype Randomization for Genome-Wide Association Studies with Differential Privacy
Anupama Nandi, Seth Neel, and Hyunghoon Cho

TL;DR
This paper introduces optimization-based differential privacy mechanisms for GWAS that improve the accuracy of released genetic association statistics while ensuring participant privacy, enabling comprehensive and private data sharing.
Contribution
It proposes novel DP techniques using optimization and personalized priors to enhance utility in GWAS data release, addressing limitations of previous methods.
Findings
High accuracy in privacy-preserving GWAS statistics on UK Biobank data
Effective reduction of noise through optimization-based randomization
Sample-specific privacy optimization improves utility
Abstract
Genome-wide association studies (GWAS) are an essential tool in biomedical research for identifying genetic factors linked to health and disease. However, publicly releasing GWAS summary statistics poses well-recognized privacy risks, including the potential to infer an individual's participation in the study or to reveal sensitive phenotypic information (e.g., disease status). While differential privacy (DP) offers a rigorous mathematical framework for mitigating these risks, existing DP techniques for GWAS either introduce excessive noise or restrict the release to a limited set of results. In this work, we present practical DP mechanisms for releasing the complete set of genome-wide association statistics with privacy guarantees. We demonstrate the accuracy of the privacy-preserving statistics released by our mechanisms on a range of GWAS datasets from the UK Biobank, utilizing both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Genetic Associations and Epidemiology · Advanced Causal Inference Techniques
