Securing Genomic Data Against Inference Attacks in Federated Learning Environments
Chetan Pathade, Shubham Patil

TL;DR
This paper evaluates the vulnerability of federated learning in genomic data sharing to inference attacks, demonstrating significant privacy risks and highlighting the need for enhanced protective measures.
Contribution
It provides a comprehensive simulation of inference attacks on federated genomic data, revealing vulnerabilities and guiding future privacy-preserving solutions.
Findings
Gradient-Based MIA achieves 0.79 precision and 0.87 F1-score
Gradient exposure poses significant privacy risks
Naive federated learning is inadequate for genomic privacy
Abstract
Federated Learning (FL) offers a promising framework for collaboratively training machine learning models across decentralized genomic datasets without direct data sharing. While this approach preserves data locality, it remains susceptible to sophisticated inference attacks that can compromise individual privacy. In this study, we simulate a federated learning setup using synthetic genomic data and assess its vulnerability to three key attack vectors: Membership Inference Attack (MIA), Gradient-Based Membership Inference Attack, and Label Inference Attack (LIA). Our experiments reveal that Gradient-Based MIA achieves the highest effectiveness, with a precision of 0.79 and F1-score of 0.87, underscoring the risk posed by gradient exposure in federated updates. Additionally, we visualize comparative attack performance through radar plots and quantify model leakage across clients. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
