SemiFed: Semi-supervised Federated Learning with Consistency and   Pseudo-Labeling

Haowen Lin; Jian Lou; Li Xiong; Cyrus Shahabi

arXiv:2108.09412·cs.LG·August 24, 2021·23 cites

SemiFed: Semi-supervised Federated Learning with Consistency and Pseudo-Labeling

Haowen Lin, Jian Lou, Li Xiong, Cyrus Shahabi

PDF

Open Access

TL;DR

SemiFed introduces a federated learning framework that leverages semi-supervised techniques like consistency regularization and pseudo-labeling to improve model accuracy with partially labeled data across clients.

Contribution

It unifies consistency regularization and pseudo-labeling in federated learning, addressing the challenge of limited labeled data in cross-silo settings.

Findings

01

SemiFed outperforms baseline methods on image benchmarks.

02

Effective in both homogeneous and heterogeneous data distributions.

03

Utilizes model agreement for high-confidence pseudo-labels.

Abstract

Federated learning enables multiple clients, such as mobile phones and organizations, to collaboratively learn a shared model for prediction while protecting local data privacy. However, most recent research and applications of federated learning assume that all clients have fully labeled data, which is impractical in real-world settings. In this work, we focus on a new scenario for cross-silo federated learning, where data samples of each client are partially labeled. We borrow ideas from semi-supervised learning methods where a large amount of unlabeled data is utilized to improve the model's accuracy despite limited access to labeled examples. We propose a new framework dubbed SemiFed that unifies two dominant approaches for semi-supervised learning: consistency regularization and pseudo-labeling. SemiFed first applies advanced data augmentation techniques to enforce consistency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI