MedForge: Building Medical Foundation Models Like Open Source Software   Development

Zheling Tan; Kexin Ding; Jin Gao; Mu Zhou; Dimitris Metaxas; Shaoting; Zhang; Dequan Wang

arXiv:2502.16055·cs.LG·February 27, 2025

MedForge: Building Medical Foundation Models Like Open Source Software Development

Zheling Tan, Kexin Ding, Jin Gao, Mu Zhou, Dimitris Metaxas, Shaoting, Zhang, Dequan Wang

PDF

Open Access 1 Repo

TL;DR

MedForge introduces a community-driven framework for developing medical foundation models that preserves patient privacy, enables task-specific adaptation, and improves multi-institutional clinical performance.

Contribution

It proposes a novel cooperative framework using LoRA modules for scalable, privacy-preserving medical model development across institutions.

Findings

01

Strong performance on multiple clinical datasets

02

Effective multi-center clinical collaboration

03

Preserves raw patient data privacy

Abstract

Foundational models (FMs) have made significant strides in the healthcare domain. Yet the data silo challenge and privacy concern remain in healthcare systems, hindering safe medical data sharing and collaborative model development among institutions. The collection and curation of scalable clinical datasets increasingly become the bottleneck for training strong FMs. In this study, we propose Medical Foundation Models Merging (MedForge), a cooperative framework enabling a community-driven medical foundation model development, meanwhile preventing the information leakage of raw patient data and mitigating synchronization model development issues across clinical institutions. MedForge offers a bottom-up model construction mechanism by flexibly merging task-specific Low-Rank Adaptation (LoRA) modules, which can adapt to downstream tasks while retaining original model parameters. Through an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tanzheling/medforge
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Privacy-Preserving Technologies in Data · Advanced Graph Neural Networks