A new type of federated clustering: A non-model-sharing approach
Yuji Kawamata, Kaoru Kamijo, Masateru Kihira, Akihiro Toyoda, Tomoru Nakayama, Akira Imakura, Tetsuya Sakurai, Yukihiko Okada

TL;DR
This paper introduces DC-Clustering, a federated clustering method that enables privacy-preserving, efficient, and flexible clustering over complex distributed data without sharing raw data, suitable for sensitive domains.
Contribution
It proposes a novel federated clustering approach supporting complex data splits, sharing only intermediate representations, and achieving high performance with minimal communication.
Findings
Achieves clustering performance comparable to centralized methods.
Supports complex data partitioning scenarios with horizontal and vertical splits.
Requires only a single communication round.
Abstract
In recent years, the growing need to leverage sensitive data across institutions has led to increased attention on federated learning (FL), a decentralized machine learning paradigm that enables model training without sharing raw data. However, existing FL-based clustering methods, known as federated clustering, typically assume simple data partitioning scenarios such as horizontal or vertical splits, and cannot handle more complex distributed structures. This study proposes data collaboration clustering (DC-Clustering), a novel federated clustering method that supports clustering over complex data partitioning scenarios where horizontal and vertical splits coexist. In DC-Clustering, each institution shares only intermediate representations instead of raw data, ensuring privacy preservation while enabling collaborative clustering. The method allows flexible selection between k-means and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Big Data and Digital Economy · Stochastic Gradient Optimization Techniques
MethodsSoftmax · Attention Is All You Need
