Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification

Xinyuan Zhao; Yihang Wu; Ahmad Chaddad; Tareef Daqqaq; Reem Kateb

arXiv:2602.01633·cs.CV·February 3, 2026

Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification

Xinyuan Zhao, Yihang Wu, Ahmad Chaddad, Tareef Daqqaq, Reem Kateb

PDF

Open Access

TL;DR

This paper introduces a federated learning framework with adaptive focal loss and client-aware aggregation to improve medical image classification across heterogeneous and imbalanced datasets, achieving significant accuracy gains.

Contribution

It proposes a novel federated learning approach that dynamically adjusts for class imbalance and client heterogeneity in medical image classification tasks.

Findings

01

Outperforms existing models on three public datasets.

02

Achieves accuracy improvements up to 41.69%.

03

Validates effectiveness through ablation studies.

Abstract

While deep learning models like Vision Transformer (ViT) have achieved significant advances, they typically require large datasets. With data privacy regulations, access to many original datasets is restricted, especially medical images. Federated learning (FL) addresses this challenge by enabling global model aggregation without data exchange. However, the heterogeneity of the data and the class imbalance that exist in local clients pose challenges for the generalization of the model. This study proposes a FL framework leveraging a dynamic adaptive focal loss (DAFL) and a client-aware aggregation strategy for local training. Specifically, we design a dynamic class imbalance coefficient that adjusts based on each client's sample distribution and class data distribution, ensuring minority classes receive sufficient attention and preventing sparse data from being ignored. To address…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Retinal Imaging and Analysis · COVID-19 diagnosis using AI