Privacy Preserving K-Means Clustering: A Secure Multi-Party Computation   Approach

Daniel Hurtado Ram\'irez; J. M. Au\~n\'on

arXiv:2009.10453·cs.LG·September 23, 2020·1 cites

Privacy Preserving K-Means Clustering: A Secure Multi-Party Computation Approach

Daniel Hurtado Ram\'irez, J. M. Au\~n\'on

PDF

Open Access

TL;DR

This paper presents a cryptographic protocol for privacy-preserving K-means clustering that enables data analysis across distributed sources without compromising individual data privacy.

Contribution

It introduces a secure multi-party computation approach for K-means clustering applicable to horizontally and vertically distributed data sources.

Findings

01

Effective privacy preservation in distributed K-means clustering

02

Applicable to both horizontal and vertical data distributions

03

Maintains clustering accuracy while ensuring data confidentiality

Abstract

Knowledge discovery is one of the main goals of Artificial Intelligence. This Knowledge is usually stored in databases spread in different environments, being a tedious (or impossible) task to access and extract data from them. To this difficulty we must add that these datasources may contain private data, therefore the information can never leave the source. Privacy Preserving Machine Learning (PPML) helps to overcome this difficulty, employing cryptographic techniques, allowing knowledge discovery while ensuring data privacy. K-means is one of the data mining techniques used in order to discover knowledge, grouping data points in clusters that contain similar features. This paper focuses in Privacy Preserving Machine Learning applied to K-means using recent protocols from the field of criptography. The algorithm is applied to different scenarios where data may be distributed either…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Blockchain Technology Applications and Security