SelfFed: Self-Supervised Federated Learning for Data Heterogeneity and Label Scarcity in Medical Images
Sunder Ali Khowaja, Kapal Dev, Syed Muhammad Anwar, Marius George, Linguraru

TL;DR
SelfFed is a novel federated learning framework that effectively addresses data heterogeneity and label scarcity in medical imaging by combining pre-training, contrastive learning, and a new aggregation strategy, leading to improved performance on challenging datasets.
Contribution
The paper introduces SelfFed, a self-supervised federated learning approach that tackles data heterogeneity and label scarcity in medical images through a two-phase process with innovative strategies.
Findings
Achieves up to 8.8% and 4.1% improvements on Retina and COVID-FL datasets.
Performs well even with only 10% labeled data.
Outperforms existing federated learning baselines.
Abstract
Self-supervised learning in the federated learning paradigm has been gaining a lot of interest both in industry and research due to the collaborative learning capability on unlabeled yet isolated data. However, self-supervised based federated learning strategies suffer from performance degradation due to label scarcity and diverse data distributions, i.e., data heterogeneity. In this paper, we propose the SelfFed framework for medical images to overcome data heterogeneity and label scarcity issues. The first phase of the SelfFed framework helps to overcome the data heterogeneity issue by leveraging the pre-training paradigm that performs augmentative modeling using Swin Transformer-based encoder in a decentralized manner. The label scarcity issue is addressed by fine-tuning paradigm that introduces a contrastive network and a novel aggregation strategy. We perform our experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · AI in cancer detection · Advanced Technologies in Various Fields
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Label Smoothing · Adam · Stochastic Depth · Position-Wise Feed-Forward Layer
