TL;DR
This paper introduces a self-supervised federated learning framework using Transformer-based masked image modeling, significantly improving model robustness and accuracy in heterogeneous medical imaging datasets without extra pre-training data.
Contribution
It presents a novel Transformer-based self-supervised pre-training paradigm for federated learning on medical images, enhancing robustness and performance under data heterogeneity.
Findings
Improves test accuracy by up to 5.06% on retinal data
Enhances model robustness against data heterogeneity
Models generalize better to out-of-distribution data
Abstract
The collection and curation of large-scale medical datasets from multiple institutions is essential for training accurate deep learning models, but privacy concerns often hinder data sharing. Federated learning (FL) is a promising solution that enables privacy-preserving collaborative learning among different institutions, but it generally suffers from performance deterioration due to heterogeneous data distributions and a lack of quality labeled data. In this paper, we present a robust and label-efficient self-supervised FL framework for medical image analysis. Our method introduces a novel Transformer-based self-supervised pre-training paradigm that pre-trains models directly on decentralized target task datasets using masked image modeling, to facilitate more robust representation learning on heterogeneous data and effective knowledge transfer to downstream models. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
