Re-identification of Individuals in Genomic Datasets Using Public Face Images
Rajagopal Venkatesaramani, Bradley A. Malin, Yevgeniy Vorobeychik

TL;DR
This paper assesses the real-world risk of re-identifying individuals in genomic datasets using public face images, finding the actual threat is lower than previously thought and proposing noise addition as a mitigation strategy.
Contribution
It provides a systematic analysis of re-identification risks with real images and introduces a method to reduce this risk through imperceptible noise addition.
Findings
Re-identification success is lower with real images than prior studies suggested.
Adding small, carefully crafted noise reduces re-identification risk significantly.
The proposed noise method maintains image quality while protecting privacy.
Abstract
DNA sequencing is becoming increasingly commonplace, both in medical and direct-to-consumer settings. To promote discovery, collected genomic data is often de-identified and shared, either in public repositories, such as OpenSNP, or with researchers through access-controlled repositories. However, recent studies have suggested that genomic data can be effectively matched to high-resolution three-dimensional face images, which raises a concern that the increasingly ubiquitous public face images can be linked to shared genomic data, thereby re-identifying individuals in the genomic data. While these investigations illustrate the possibility of such an attack, they assume that those performing the linkage have access to extremely well-curated data. Given that this is unlikely to be the case in practice, it calls into question the pragmatic nature of the attack. As such, we systematically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Genetic and rare skin diseases.
