Genomic Data Sharing under Dependent Local Differential Privacy
Emre Yilmaz, Tianxi Ji, Erman Ayday, Pan Li

TL;DR
This paper introduces a new privacy mechanism for sharing correlated genomic data under dependent local differential privacy, improving data utility and protecting against inference attacks, with evaluation on real genomic data.
Contribution
It proposes a novel dependent local differential privacy mechanism tailored for genomic data, considering data correlations and family privacy, with a greedy algorithm for utility maximization.
Findings
Outperforms traditional randomized response in utility.
Effectively prevents inference of correlated genomic data.
Balances privacy and utility considering family privacy preferences.
Abstract
Privacy-preserving genomic data sharing is prominent to increase the pace of genomic research, and hence to pave the way towards personalized genomic medicine. In this paper, we introduce ()-dependent local differential privacy (LDP) for privacy-preserving sharing of correlated data and propose a genomic data sharing mechanism under this privacy definition. We first show that the original definition of LDP is not suitable for genomic data sharing, and then we propose a new mechanism to share genomic data. The proposed mechanism considers the correlations in data during data sharing, eliminates statistically unlikely data values beforehand, and adjusts the probability distributions for each shared data point accordingly. By doing so, we show that we can avoid an attacker from inferring the correct values of the shared data points by utilizing the correlations in the data.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Reproductive Health and Technologies
