Differentially private $k$-means clustering via exponential mechanism and max cover
Anamay Chaturvedi, Huy Nguyen, Eric Xu

TL;DR
This paper presents a new differentially private $k$-means clustering algorithm that reduces additive error by leveraging maximum coverage on a grid, showing improved practical performance over prior methods.
Contribution
The authors introduce a novel $( ext{epsilon}_p, ext{delta}_p)$-differentially private algorithm for $k$-means that achieves lower additive error by reducing the problem to maximum coverage on a grid.
Findings
Achieves lower additive error compared to previous methods.
Maintains constant multiplicative error.
Experimental results show improved performance.
Abstract
We introduce a new -differentially private algorithm for the -means clustering problem. Given a dataset in Euclidean space, the -means clustering problem requires one to find points in that space such that the sum of squares of Euclidean distances between each data point and its closest respective point among the returned is minimised. Although there exist privacy-preserving methods with good theoretical guarantees to solve this problem [Balcan et al., 2017; Kaplan and Stemmer, 2018], in practice it is seen that it is the additive error which dictates the practical performance of these methods. By reducing the problem to a sequence of instances of maximum coverage on a grid, we are able to derive a new method that achieves lower additive error then previous works. For input datasets with cardinality and diameter , our algorithm has an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Cryptography and Data Security
