Heterogeneity for the Win: One-Shot Federated Clustering
Don Kurian Dennis, Tian Li, Virginia Smith

TL;DR
This paper introduces $k$-FED, a one-shot federated clustering method that leverages data heterogeneity to improve clustering efficiency and requires minimal communication, with practical applications in personalized federated learning.
Contribution
The paper presents a novel one-shot federated clustering algorithm, $k$-FED, that exploits heterogeneity to relax cluster separation requirements and operates efficiently in practical federated settings.
Findings
Heterogeneity can be advantageous in federated clustering under certain conditions.
$k$-FED requires only one communication round and can operate asynchronously.
Experimental results demonstrate the practical utility of $k$-FED in personalized FL and device sampling.
Abstract
In this work, we explore the unique challenges -- and opportunities -- of unsupervised federated learning (FL). We develop and analyze a one-shot federated clustering scheme, -FED, based on the widely-used Lloyd's method for -means clustering. In contrast to many supervised problems, we show that the issue of statistical heterogeneity in federated networks can in fact benefit our analysis. We analyse -FED under a center separation assumption and compare it to the best known requirements of its centralized counterpart. Our analysis shows that in heterogeneous regimes where the number of clusters per device is smaller than the total number of clusters over the network , , we can use heterogeneity to our advantage -- significantly weakening the cluster separation requirements for -FED. From a practical viewpoint, -FED also has many desirable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Internet Traffic Analysis and Secure E-voting · Mobile Crowdsensing and Crowdsourcing
