FedClust: Tackling Data Heterogeneity in Federated Learning through Weight-Driven Client Clustering
Md Sirajul Islam, Simin Javaherian, Fei Xu, Xu Yuan, Li Chen,, Nian-Feng Tzeng

TL;DR
FedClust introduces a weight-driven client clustering method for federated learning that effectively handles data heterogeneity, achieving higher accuracy and faster convergence with fewer communication rounds compared to existing approaches.
Contribution
FedClust proposes a one-shot clustering approach based on local model weights, eliminating the need for multiple communication rounds and predefined cluster numbers in federated learning.
Findings
Achieves up to 45% higher accuracy.
Reduces communication cost by up to 2.7 times.
Converges faster than state-of-the-art methods.
Abstract
Federated learning (FL) is an emerging distributed machine learning paradigm that enables collaborative training of machine learning models over decentralized devices without exposing their local data. One of the major challenges in FL is the presence of uneven data distributions across client devices, violating the well-known assumption of independent-and-identically-distributed (IID) training samples in conventional machine learning. To address the performance degradation issue incurred by such data heterogeneity, clustered federated learning (CFL) shows its promise by grouping clients into separate learning clusters based on the similarity of their local data distributions. However, state-of-the-art CFL approaches require a large number of communication rounds to learn the distribution similarities during training until the formation of clusters is stabilized. Moreover, some of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Privacy, Security, and Data Protection
