Federated One-Shot Ensemble Clustering
Rui Duan, Xin Xiong, Jueyi Liu, Katherine P. Liao, Tianxi Cai

TL;DR
The paper introduces FONT, a federated clustering algorithm that enables multi-site data analysis with privacy and communication constraints, demonstrating superior performance and consistency in real-world applications.
Contribution
FONT is a novel federated clustering method that requires only one communication round and combines local models into a data-adaptive ensemble, ensuring privacy and robustness.
Findings
FONT outperforms existing benchmark methods in simulations.
It improves cluster consistency across different health systems.
Theoretical analysis supports the effectiveness of data-adaptive weights.
Abstract
Cluster analysis across multiple institutions poses significant challenges due to data-sharing restrictions. To overcome these limitations, we introduce the Federated One-shot Ensemble Clustering (FONT) algorithm, a novel solution tailored for multi-site analyses under such constraints. FONT requires only a single round of communication between sites and ensures privacy by exchanging only fitted model parameters and class labels. The algorithm combines locally fitted clustering models into a data-adaptive ensemble, making it broadly applicable to various clustering techniques and robust to differences in cluster proportions across sites. Our theoretical analysis validates the effectiveness of the data-adaptive weights learned by FONT, and simulation studies demonstrate its superior performance compared to existing benchmark methods. We applied FONT to identify subgroups of patients with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Brain Tumor Detection and Classification · Privacy-Preserving Technologies in Data
MethodsEnsemble Clustering
