MedForge: Building Medical Foundation Models Like Open Source Software Development
Zheling Tan, Kexin Ding, Jin Gao, Mu Zhou, Dimitris Metaxas, Shaoting, Zhang, Dequan Wang

TL;DR
MedForge introduces a community-driven framework for developing medical foundation models that preserves patient privacy, enables task-specific adaptation, and improves multi-institutional clinical performance.
Contribution
It proposes a novel cooperative framework using LoRA modules for scalable, privacy-preserving medical model development across institutions.
Findings
Strong performance on multiple clinical datasets
Effective multi-center clinical collaboration
Preserves raw patient data privacy
Abstract
Foundational models (FMs) have made significant strides in the healthcare domain. Yet the data silo challenge and privacy concern remain in healthcare systems, hindering safe medical data sharing and collaborative model development among institutions. The collection and curation of scalable clinical datasets increasingly become the bottleneck for training strong FMs. In this study, we propose Medical Foundation Models Merging (MedForge), a cooperative framework enabling a community-driven medical foundation model development, meanwhile preventing the information leakage of raw patient data and mitigating synchronization model development issues across clinical institutions. MedForge offers a bottom-up model construction mechanism by flexibly merging task-specific Low-Rank Adaptation (LoRA) modules, which can adapt to downstream tasks while retaining original model parameters. Through an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Privacy-Preserving Technologies in Data · Advanced Graph Neural Networks
