FastLloyd: Federated, Accurate, Secure, and Tunable $k$-Means Clustering with Differential Privacy
Abdulrahman Diaa, Thomas Humphries, Florian Kerschbaum

TL;DR
FastLloyd introduces a federated $k$-means clustering method that is faster, more private, and more accurate by combining secure aggregation and differential privacy enhancements, significantly outperforming previous approaches.
Contribution
The paper presents a novel federated $k$-means clustering algorithm that achieves high speed, improved privacy, and better utility by integrating secure computation with differential privacy in a lightweight manner.
Findings
Achieves five orders of magnitude speed-up over prior work.
Maintains and improves utility in differential privacy models.
Provides a secure, privacy-preserving federated clustering solution.
Abstract
We study the problem of privacy-preserving -means clustering in the horizontally federated setting. Existing federated approaches using secure computation suffer from substantial overheads and do not offer output privacy. At the same time, differentially private (DP) -means algorithms either assume a trusted central curator or significantly degrade utility by adding noise in the local DP model. Naively combining the secure and central DP solutions results in a protocol with impractical overhead. Instead, our work provides enhancements to both the DP and secure computation components, resulting in a design that is faster, more private, and more accurate than previous work. By utilizing the computational DP model, we design a lightweight, secure aggregation-based approach that achieves five orders of magnitude speed-up over state-of-the-art related work. Furthermore, we not only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Clustering Algorithms Research · Customer churn and segmentation
