Exploratory Analysis of Federated Learning Methods with Differential   Privacy on MIMIC-III

Aron N. Horvath; Matteo Berchier; Farhad Nooralahzadeh; Ahmed Allam,; Michael Krauthammer

arXiv:2302.04208·cs.LG·February 9, 2023·1 cites

Exploratory Analysis of Federated Learning Methods with Differential Privacy on MIMIC-III

Aron N. Horvath, Matteo Berchier, Farhad Nooralahzadeh, Ahmed Allam,, Michael Krauthammer

PDF

Open Access

TL;DR

This paper evaluates federated learning with differential privacy on healthcare data, analyzing how data distribution, communication, and privacy techniques affect model performance and privacy leakage.

Contribution

It provides an extensive empirical comparison of federation and differential privacy methods on MIMIC-III, offering guidance on optimal parameter choices.

Findings

01

FedProx outperforms FedAvg in imbalanced data scenarios.

02

Both DP techniques achieve performance close to non-private models.

03

Significant privacy leakage remains despite differential privacy measures.

Abstract

Background: Federated learning methods offer the possibility of training machine learning models on privacy-sensitive data sets, which cannot be easily shared. Multiple regulations pose strict requirements on the storage and usage of healthcare data, leading to data being in silos (i.e. locked-in at healthcare facilities). The application of federated algorithms on these datasets could accelerate disease diagnostic, drug development, as well as improve patient care. Methods: We present an extensive evaluation of the impact of different federation and differential privacy techniques when training models on the open-source MIMIC-III dataset. We analyze a set of parameters influencing a federated model performance, namely data distribution (homogeneous and heterogeneous), communication strategies (communication rounds vs. local training epochs), federation strategies (FedAvg vs.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data