A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Anran Li; Yuanyuan Chen; Wenjun Long; Yu Yin; Yan Hu; Hyunjae Kim; Weipeng Zhou; Yujia Zhou; Hongyi Peng; Yang Ren; Xuguang Ai; Zhenyue Qin; Ming Hu; Xiaoxiao Li; Han Yu; Yih-Chung Tham; Lucila Ohno-Machado; Hua Xu; Qingyu Chen

arXiv:2601.22124·cs.CL·January 30, 2026

A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Anran Li, Yuanyuan Chen, Wenjun Long, Yu Yin, Yan Hu, Hyunjae Kim, Weipeng Zhou, Yujia Zhou, Hongyi Peng, Yang Ren, Xuguang Ai, Zhenyue Qin, Ming Hu, Xiaoxiao Li, Han Yu, Yih-Chung Tham, Lucila Ohno-Machado, Hua Xu, Qingyu Chen

PDF

Open Access

TL;DR

This paper presents a federated, parameter-efficient learning framework for large language models in medicine, enabling collaborative, privacy-preserving model training across healthcare institutions with heterogeneous data.

Contribution

It introduces Fed-MedLoRA, a low-rank adapter-based federated learning method that reduces communication overhead and improves model convergence in heterogeneous clinical data settings.

Findings

01

Fed-MedLoRA reduces communication costs significantly.

02

The framework achieves comparable accuracy to centralized models.

03

It demonstrates effective adaptation to new clinical sites with limited data.

Abstract

Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clinical settings, LLMs are typically further adapted through continued pretraining or post-training using clinical data. However, most medical LLMs are trained on data from a single institution, which faces limitations in generalizability and safety in heterogeneous systems. Federated learning (FL) is a promising solution for enabling collaborative model development across healthcare institutions. Yet applying FL to LLMs in medicine remains fundamentally limited. First, conventional FL requires transmitting the full model during each communication round, which becomes impractical for multi-billion-parameter LLMs given the limited computational resources. Second, many FL algorithms implicitly assume data homogeneity, whereas…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Privacy-Preserving Technologies in Data · Artificial Intelligence in Healthcare and Education