Exploratory Analysis of Federated Learning Methods with Differential Privacy on MIMIC-III
Aron N. Horvath, Matteo Berchier, Farhad Nooralahzadeh, Ahmed Allam,, Michael Krauthammer

TL;DR
This paper evaluates federated learning with differential privacy on healthcare data, analyzing how data distribution, communication, and privacy techniques affect model performance and privacy leakage.
Contribution
It provides an extensive empirical comparison of federation and differential privacy methods on MIMIC-III, offering guidance on optimal parameter choices.
Findings
FedProx outperforms FedAvg in imbalanced data scenarios.
Both DP techniques achieve performance close to non-private models.
Significant privacy leakage remains despite differential privacy measures.
Abstract
Background: Federated learning methods offer the possibility of training machine learning models on privacy-sensitive data sets, which cannot be easily shared. Multiple regulations pose strict requirements on the storage and usage of healthcare data, leading to data being in silos (i.e. locked-in at healthcare facilities). The application of federated algorithms on these datasets could accelerate disease diagnostic, drug development, as well as improve patient care. Methods: We present an extensive evaluation of the impact of different federation and differential privacy techniques when training models on the open-source MIMIC-III dataset. We analyze a set of parameters influencing a federated model performance, namely data distribution (homogeneous and heterogeneous), communication strategies (communication rounds vs. local training epochs), federation strategies (FedAvg vs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
