Label-Efficient Self-Supervised Federated Learning for Tackling Data   Heterogeneity in Medical Imaging

Rui Yan; Liangqiong Qu; Qingyue Wei; Shih-Cheng Huang; Liyue Shen,; Daniel Rubin; Lei Xing; Yuyin Zhou

arXiv:2205.08576·cs.CV·January 12, 2023

Label-Efficient Self-Supervised Federated Learning for Tackling Data Heterogeneity in Medical Imaging

Rui Yan, Liangqiong Qu, Qingyue Wei, Shih-Cheng Huang, Liyue Shen,, Daniel Rubin, Lei Xing, Yuyin Zhou

PDF

1 Repo

TL;DR

This paper introduces a self-supervised federated learning framework using Transformer-based masked image modeling, significantly improving model robustness and accuracy in heterogeneous medical imaging datasets without extra pre-training data.

Contribution

It presents a novel Transformer-based self-supervised pre-training paradigm for federated learning on medical images, enhancing robustness and performance under data heterogeneity.

Findings

01

Improves test accuracy by up to 5.06% on retinal data

02

Enhances model robustness against data heterogeneity

03

Models generalize better to out-of-distribution data

Abstract

The collection and curation of large-scale medical datasets from multiple institutions is essential for training accurate deep learning models, but privacy concerns often hinder data sharing. Federated learning (FL) is a promising solution that enables privacy-preserving collaborative learning among different institutions, but it generally suffers from performance deterioration due to heterogeneous data distributions and a lack of quality labeled data. In this paper, we present a robust and label-efficient self-supervised FL framework for medical image analysis. Our method introduces a novel Transformer-based self-supervised pre-training paradigm that pre-trains models directly on decentralized target task datasets using masked image modeling, to facilitate more robust representation learning on heterogeneous data and effective knowledge transfer to downstream models. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rui-yan/ssl-fl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.