A Scalable Approach for Privacy-Preserving Collaborative Machine Learning
Jinhyun So, Basak Guler, A. Salman Avestimehr

TL;DR
This paper introduces COPML, a decentralized framework for privacy-preserving collaborative logistic regression training that ensures strong privacy guarantees and significantly improves training speed compared to existing methods.
Contribution
COPML is a novel fully-decentralized framework that encodes data securely and enables scalable, privacy-preserving collaborative training with proven convergence.
Findings
Achieves up to 16x speedup over benchmark protocols
Provides strong statistical privacy against colluding adversaries
Proven convergence of the training process
Abstract
We consider a collaborative learning scenario in which multiple data-owners wish to jointly train a logistic regression model, while keeping their individual datasets private from the other parties. We propose COPML, a fully-decentralized training framework that achieves scalability and privacy-protection simultaneously. The key idea of COPML is to securely encode the individual datasets to distribute the computation load effectively across many parties and to perform the training computations as well as the model updates in a distributed manner on the securely encoded data. We provide the privacy analysis of COPML and prove its convergence. Furthermore, we experimentally demonstrate that COPML can achieve significant speedup in training over the benchmark protocols. Our protocol provides strong statistical privacy guarantees against colluding parties (adversaries) with unbounded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Blockchain Technology Applications and Security
MethodsLogistic Regression
