Federated K-means Clustering

Swier Garst; Marcel Reinders

arXiv:2310.01195·cs.LG·February 19, 2024

Federated K-means Clustering

Swier Garst, Marcel Reinders

PDF

Open Access

TL;DR

This paper introduces a federated K-means clustering algorithm that enables unsupervised learning across distributed datasets while addressing challenges like varying cluster numbers and convergence issues on less separable data.

Contribution

It presents a novel federated K-means algorithm that handles varying cluster counts and improves convergence on challenging datasets, filling a gap in unsupervised federated learning.

Findings

01

Effective clustering on distributed data without data pooling

02

Addresses varying number of clusters in federated settings

03

Improves convergence on less separable datasets

Abstract

Federated learning is a technique that enables the use of distributed datasets for machine learning purposes without requiring data to be pooled, thereby better preserving privacy and ownership of the data. While supervised FL research has grown substantially over the last years, unsupervised FL methods remain scarce. This work introduces an algorithm which implements K-means clustering in a federated manner, addressing the challenges of varying number of clusters between centers, as well as convergence on less separable datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Vehicular Ad Hoc Networks (VANETs) · Data Mining Algorithms and Applications

Methodsk-Means Clustering