PD-ML-Lite: Private Distributed Machine Learning from Lighweight Cryptography
Maksim Tsikhanovich, Malik Magdon-Ismail, Muhammad Ishaq and, Vassilis Zikas

TL;DR
This paper introduces PD-ML-Lite, a privacy-preserving distributed machine learning methodology using lightweight cryptography, achieving accuracy comparable to non-private methods with optimal communication and measurable privacy guarantees.
Contribution
The paper presents a novel approach applying lightweight cryptographic protocols to distributed ML algorithms, maintaining accuracy and efficiency without heavy MPC frameworks.
Findings
Protocols are communication optimal.
Achieve same accuracy as non-private algorithms.
Applicable to various ML tasks like topic modeling and recommender systems.
Abstract
Privacy is a major issue in learning from distributed data. Recently the cryptographic literature has provided several tools for this task. However, these tools either reduce the quality/accuracy of the learning algorithm---e.g., by adding noise---or they incur a high performance penalty and/or involve trusting external authorities. We propose a methodology for {\sl private distributed machine learning from light-weight cryptography} (in short, PD-ML-Lite). We apply our methodology to two major ML algorithms, namely non-negative matrix factorization (NMF) and singular value decomposition (SVD). Our resulting protocols are communication optimal, achieve the same accuracy as their non-private counterparts, and satisfy a notion of privacy---which we define---that is both intuitive and measurable. Our approach is to use lightweight cryptographic protocols (secure sum and normalized secure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCryptography and Data Security · Privacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
