Federated Clustering: An Unsupervised Cluster-Wise Training for Decentralized Data Distributions
Mirko Nardi, Lorenzo Valerio, Andrea Passarella

TL;DR
FedCRef introduces a federated clustering method that enables decentralized clients to collaboratively identify multiple underlying data distributions without labels, handling heterogeneous data and without prior knowledge of cluster counts.
Contribution
This work presents FedCRef, a novel unsupervised federated learning approach that generalizes to multi-cluster scenarios and improves clustering accuracy across decentralized, heterogeneous datasets.
Findings
Achieves up to 95% local accuracy on public datasets.
Effectively uncovers true global data distributions.
Robust to noisy data and scalable for resource-constrained devices.
Abstract
Federated Learning (FL) enables decentralized machine learning while preserving data privacy, making it ideal for sensitive applications where data cannot be shared. While FL has been widely studied in supervised contexts, its application to unsupervised learning remains underdeveloped. This work introduces FedCRef, a novel unsupervised federated learning method designed to uncover all underlying data distributions across decentralized clients without requiring labels. This task, known as Federated Clustering, presents challenges due to heterogeneous, non-uniform data distributions and the lack of centralized coordination. Unlike previous methods that assume a one-cluster-per-client setup or require prior knowledge of the number of clusters, FedCRef generalizes to multi-cluster-per-client scenarios. Clients iteratively refine their data partitions while discovering all distinct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Data Mining Algorithms and Applications · Advanced Clustering Algorithms Research
MethodsSparse Evolutionary Training · ALIGN
