Distributed and Secure ML with Self-tallying Multi-party Aggregation

Yunhui Long; Tanmay Gangwani; Haris Mughees; Carl Gunter

arXiv:1811.10296·cs.CR·November 27, 2018·1 cites

Distributed and Secure ML with Self-tallying Multi-party Aggregation

Yunhui Long, Tanmay Gangwani, Haris Mughees, Carl Gunter

PDF

Open Access 1 Repo

TL;DR

This paper introduces a distributed, privacy-preserving machine learning framework that enables untrusted users to collaboratively train models securely without private channels or trusted third parties, using homomorphic addition and zero-knowledge proofs.

Contribution

The work presents a novel framework combining homomorphic addition and zero-knowledge proofs for secure, distributed ML that is robust against malicious data contributions and does not require trusted third parties.

Findings

01

Framework supports various ML algorithms like LDA, Naive Bayes, and Decision Trees.

02

Ensures data privacy and integrity in untrusted environments.

03

Eliminates need for private channels and trusted third parties.

Abstract

Privacy preserving multi-party computation has many applications in areas such as medicine and online advertisements. In this work, we propose a framework for distributed, secure machine learning among untrusted individuals. The framework consists of two parts: a two-step training protocol based on homomorphic addition and a zero knowledge proof for data validity. By combining these two techniques, our framework provides privacy of per-user data, prevents against a malicious user contributing corrupted data to the shared pool, enables each user to self-compute the results of the algorithm without relying on external trusted third parties, and requires no private channels between groups of users. We show how different ML algorithms such as Latent Dirichlet Allocation, Naive Bayes, Decision Trees etc. fit our framework for distributed, secure computing.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tgangwani/Zorro_SMPC
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Data Quality and Management