Federated Learning of Medical Concepts Embedding using BEHRT
Ofir Ben Shoham, Nadav Rappoport

TL;DR
This paper introduces a federated learning approach for training medical concepts embeddings using BEHRT on EHR data, achieving performance close to centralized models and enhancing prediction accuracy while preserving data privacy.
Contribution
It presents a novel federated learning method for pre-training BEHRT-based embeddings on distributed EHR data, enabling privacy-preserving, high-performance medical prediction models.
Findings
Federated learning achieves near-centralized performance.
Pre-trained MLM improves next visit prediction accuracy.
Model outperforms local models in average precision.
Abstract
Electronic Health Records (EHR) data contains medical records such as diagnoses, medications, procedures, and treatments of patients. This data is often considered sensitive medical information. Therefore, the EHR data from the medical centers often cannot be shared, making it difficult to create prediction models using multi-center EHR data, which is essential for such models' robustness and generalizability. Federated Learning (FL) is an algorithmic approach that allows learning a shared model using data in multiple locations without the need to store all data in a central place. An example of a prediction model's task is to predict future diseases. More specifically, the model needs to predict patient's next visit diagnoses, based on current and previous clinical data. Such a prediction model can support care providers in making clinical decisions and even provide preventive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
