Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers
Yue Zhang, Chuanlong Qiu, Xinfa Liao, Yiqun Zhang

TL;DR
This paper introduces Fed-$k^*$-HC, a federated clustering framework that automatically determines the optimal number of clusters by hierarchical merging of client-generated micro-subclusters, addressing unknown and imbalanced cluster scenarios.
Contribution
It proposes a novel federated clustering method that automatically identifies the optimal cluster count using hierarchical merging and micro-subclusters, improving robustness and accuracy.
Findings
Accurately determines the number of clusters in diverse datasets.
Handles varying cluster sizes and shapes effectively.
Demonstrates superior performance over existing methods.
Abstract
Federated Clustering (FC) is an emerging and promising solution in exploring data distribution patterns from distributed and privacy-protected data in an unsupervised manner. Existing FC methods implicitly rely on the assumption that clients are with a known number of uniformly sized clusters. However, the true number of clusters is typically unknown, and cluster sizes are naturally imbalanced in real scenarios. Furthermore, the privacy-preserving transmission constraints in federated learning inevitably reduce usable information, making the development of robust and accurate FC extremely challenging. Accordingly, we propose a novel FC framework named Fed--HC, which can automatically determine an optimal number of clusters based on the data distribution explored through hierarchical clustering. To obtain the global data distribution for determination, we let each client…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Clustering Algorithms Research · Stochastic Gradient Optimization Techniques
