Federated Learning for ICD Classification with Lightweight Models and Pretrained Embeddings

Binbin Xu; G\'erard Dray

arXiv:2507.03122·cs.IR·May 20, 2026

Federated Learning for ICD Classification with Lightweight Models and Pretrained Embeddings

Binbin Xu, G\'erard Dray

PDF

TL;DR

This paper explores federated learning with lightweight models and pretrained embeddings for ICD code classification from clinical notes, demonstrating comparable performance to centralized methods while preserving privacy.

Contribution

It introduces a scalable, privacy-preserving pipeline combining frozen embeddings with simple MLP classifiers for federated clinical NLP tasks, contrasting with large language models.

Findings

01

Embedding quality impacts performance more than classifier complexity.

02

Federated learning closely matches centralized results in ideal conditions.

03

Lightweight models achieve competitive F1 scores with significantly reduced size.

Abstract

This study investigates the feasibility and performance of federated learning (FL) for multi-label ICD code classification using clinical notes from the MIMIC-IV dataset. Unlike previous approaches that rely on centralized training or fine-tuned large language models, we propose a lightweight and scalable pipeline combining frozen text embeddings with simple multilayer perceptron (MLP) classifiers. This design offers a privacy-preserving and deployment-efficient alternative for clinical NLP applications, particularly suited to distributed healthcare settings. Extensive experiments across both centralized and federated configurations were conducted, testing six publicly available embedding models from Massive Text Embedding Benchmark leaderboard and three MLP classifier architectures under two medical coding (ICD-9 and ICD-10). Additionally, ablation studies over ten random stratified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.