Fair Federated Data Clustering through Personalization: Bridging the Gap between Diverse Data Distributions
Shivam Gupta, Tarushi, Tsering Wangzes, Shweta Jain

TL;DR
This paper introduces p-FClus, a personalized federated clustering algorithm that balances clustering cost and fairness across clients, working efficiently in a single communication round and handling diverse, unlabelled data distributions.
Contribution
The paper presents the first personalized federated clustering method that reduces cost variance and is applicable to any finite norm, improving efficiency and fairness.
Findings
p-FClus achieves lower clustering cost across clients.
The method is data-independent and works with various norms.
It requires only a single communication round.
Abstract
The rapid growth of data from edge devices has catalyzed the performance of machine learning algorithms. However, the data generated resides at client devices thus there are majorly two challenge faced by traditional machine learning paradigms - centralization of data for training and secondly for most the generated data the class labels are missing and there is very poor incentives to clients to manually label their data owing to high cost and lack of expertise. To overcome these issues, there have been initial attempts to handle unlabelled data in a privacy preserving distributed manner using unsupervised federated data clustering. The goal is partition the data available on clients into partitions (called clusters) without actual exchange of data. Most of the existing algorithms are highly dependent on data distribution patterns across clients or are computationally expensive.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection
