Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification
Xinyuan Zhao, Yihang Wu, Ahmad Chaddad, Tareef Daqqaq, Reem Kateb

TL;DR
This paper introduces a federated learning framework with adaptive focal loss and client-aware aggregation to improve medical image classification across heterogeneous and imbalanced datasets, achieving significant accuracy gains.
Contribution
It proposes a novel federated learning approach that dynamically adjusts for class imbalance and client heterogeneity in medical image classification tasks.
Findings
Outperforms existing models on three public datasets.
Achieves accuracy improvements up to 41.69%.
Validates effectiveness through ablation studies.
Abstract
While deep learning models like Vision Transformer (ViT) have achieved significant advances, they typically require large datasets. With data privacy regulations, access to many original datasets is restricted, especially medical images. Federated learning (FL) addresses this challenge by enabling global model aggregation without data exchange. However, the heterogeneity of the data and the class imbalance that exist in local clients pose challenges for the generalization of the model. This study proposes a FL framework leveraging a dynamic adaptive focal loss (DAFL) and a client-aware aggregation strategy for local training. Specifically, we design a dynamic class imbalance coefficient that adjusts based on each client's sample distribution and class data distribution, ensuring minority classes receive sufficient attention and preventing sparse data from being ignored. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Retinal Imaging and Analysis · COVID-19 diagnosis using AI
