Differentially-Private Sublinear-Time Clustering
Jeremiah Blocki, Elena Grigorescu, Tamalika Mukherjee

TL;DR
This paper introduces sublinear-time differentially-private clustering algorithms by combining existing sublinear clustering methods with privacy-preserving techniques, enabling efficient and private unsupervised learning.
Contribution
It presents the first sublinear-time differentially-private clustering algorithms for k-means and k-median, leveraging subsampling and analyzing privacy benefits.
Findings
Achieves sublinear-time private clustering algorithms.
Demonstrates privacy benefits of subsampling for group privacy.
Extends prior work by combining sublinear algorithms with differential privacy.
Abstract
Clustering is an essential primitive in unsupervised machine learning. We bring forth the problem of sublinear-time differentially-private clustering as a natural and well-motivated direction of research. We combine the -means and -median sublinear-time results of Mishra et al. (SODA, 2001) and of Czumaj and Sohler (Rand. Struct. and Algorithms, 2007) with recent results on private clustering of Balcan et al. (ICML 2017), Gupta et al. (SODA, 2010) and Ghazi et al. (NeurIPS, 2020) to obtain sublinear-time private -means and -median algorithms via subsampling. We also investigate the privacy benefits of subsampling for group privacy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · HIV, Drug Use, Sexual Risk · Statistical Methods and Bayesian Inference
