Differentially private $k$-means clustering via exponential mechanism   and max cover

Anamay Chaturvedi; Huy Nguyen; Eric Xu

arXiv:2009.01220·cs.DS·September 3, 2020·1 cites

Differentially private $k$-means clustering via exponential mechanism and max cover

Anamay Chaturvedi, Huy Nguyen, Eric Xu

PDF

Open Access

TL;DR

This paper presents a new differentially private $k$-means clustering algorithm that reduces additive error by leveraging maximum coverage on a grid, showing improved practical performance over prior methods.

Contribution

The authors introduce a novel $( ext{epsilon}_p, ext{delta}_p)$-differentially private algorithm for $k$-means that achieves lower additive error by reducing the problem to maximum coverage on a grid.

Findings

01

Achieves lower additive error compared to previous methods.

02

Maintains constant multiplicative error.

03

Experimental results show improved performance.

Abstract

We introduce a new $(ϵ_{p}, δ_{p})$ -differentially private algorithm for the $k$ -means clustering problem. Given a dataset in Euclidean space, the $k$ -means clustering problem requires one to find $k$ points in that space such that the sum of squares of Euclidean distances between each data point and its closest respective point among the $k$ returned is minimised. Although there exist privacy-preserving methods with good theoretical guarantees to solve this problem [Balcan et al., 2017; Kaplan and Stemmer, 2018], in practice it is seen that it is the additive error which dictates the practical performance of these methods. By reducing the problem to a sequence of instances of maximum coverage on a grid, we are able to derive a new method that achieves lower additive error then previous works. For input datasets with cardinality $n$ and diameter $Δ$ , our algorithm has an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Cryptography and Data Security