SemiFed: Semi-supervised Federated Learning with Consistency and Pseudo-Labeling
Haowen Lin, Jian Lou, Li Xiong, Cyrus Shahabi

TL;DR
SemiFed introduces a federated learning framework that leverages semi-supervised techniques like consistency regularization and pseudo-labeling to improve model accuracy with partially labeled data across clients.
Contribution
It unifies consistency regularization and pseudo-labeling in federated learning, addressing the challenge of limited labeled data in cross-silo settings.
Findings
SemiFed outperforms baseline methods on image benchmarks.
Effective in both homogeneous and heterogeneous data distributions.
Utilizes model agreement for high-confidence pseudo-labels.
Abstract
Federated learning enables multiple clients, such as mobile phones and organizations, to collaboratively learn a shared model for prediction while protecting local data privacy. However, most recent research and applications of federated learning assume that all clients have fully labeled data, which is impractical in real-world settings. In this work, we focus on a new scenario for cross-silo federated learning, where data samples of each client are partially labeled. We borrow ideas from semi-supervised learning methods where a large amount of unlabeled data is utilized to improve the model's accuracy despite limited access to labeled examples. We propose a new framework dubbed SemiFed that unifies two dominant approaches for semi-supervised learning: consistency regularization and pseudo-labeling. SemiFed first applies advanced data augmentation techniques to enforce consistency…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
