Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients
Nan Lu, Zhao Wang, Xiaoxiao Li, Gang Niu, Qi Dou, Masashi Sugiyama

TL;DR
This paper introduces FedUL, a federated learning method that enables training classifiers from unlabeled data across clients by transforming unlabeled data into surrogate labels, thus broadening FL applicability without requiring data labeling.
Contribution
The paper presents FedUL, a novel unsupervised federated learning framework that converts unlabeled data into surrogate labels, compatible with existing FL methods, and guarantees model recovery.
Findings
FedUL effectively trains classifiers from unlabeled data in federated settings.
Experimental results show FedUL outperforms baseline methods on benchmark datasets.
FedUL is compatible with various supervised FL algorithms and guarantees model accuracy.
Abstract
Supervised federated learning (FL) enables multiple clients to share the trained model without sharing their labeled data. However, potential clients might even be reluctant to label their own data, which could limit the applicability of FL in practice. In this paper, we show the possibility of unsupervised FL whose model is still a classifier for predicting class labels, if the class-prior probabilities are shifted while the class-conditional distributions are shared among the unlabeled data owned by the clients. We propose federation of unsupervised learning (FedUL), where the unlabeled data are transformed into surrogate labeled data for each of the clients, a modified model is trained by supervised FL, and the wanted model is recovered from the modified model. FedUL is a very general solution to unsupervised FL: it is compatible with many supervised FL methods, and the recovery of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Imbalanced Data Classification Techniques
