Federated Learning for Heterogeneous Electronic Health Record Systems with Cost Effective Participant Selection
Jiyoun Kim, Junu Kim, Kyunghoon Hur, Edward Choi

TL;DR
This paper introduces EHRFL, a federated learning framework that enhances compatibility across heterogeneous EHR systems and reduces costs through a privacy-preserving participant selection strategy, enabling efficient, institution-specific clinical prediction models.
Contribution
The paper presents a novel federated learning approach combining text-based EHR modeling and a differentially private participant selection method for cost-effective, compatible, and privacy-preserving healthcare modeling.
Findings
Effective cross-institution compatibility without costly standardization
Cost reduction in federated learning via participant selection
Maintained model performance with fewer participants
Abstract
The increasing volume of electronic health records (EHRs) presents the opportunity to improve the accuracy and robustness of models in clinical prediction tasks. Unlike traditional centralized approaches, federated learning enables training on data from multiple institutions while preserving patient privacy and complying with regulatory constraints. In practice, healthcare institutions (i.e., hosts) often need to build predictive models tailored to their specific needs (e.g., creatinine-level prediction, N-day readmission prediction) using federated learning. When building a federated learning model for a single healthcare institution, two key challenges arise: (1) ensuring compatibility across heterogeneous EHR systems, and (2) managing federated learning costs within budget constraints. Specifically, heterogeneity in EHR systems across institutions hinders compatible modeling, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
MethodsFocus
