# CodedPrivateML: A Fast and Privacy-Preserving Framework for Distributed   Machine Learning

**Authors:** Jinhyun So, Basak Guler, A. Salman Avestimehr

arXiv: 1902.00641 · 2021-02-23

## TL;DR

CodedPrivateML is a scalable framework that enables privacy-preserving distributed machine learning by ensuring data and model privacy while maintaining high training efficiency and convergence guarantees.

## Contribution

It introduces CodedPrivateML, a novel approach combining information-theoretic privacy with parallel training, outperforming cryptographic methods in speed.

## Key findings

- Achieves privacy threshold with theoretical guarantees
- Demonstrates convergence for logistic and linear regression
- Provides significant speedup over MPC-based approaches

## Abstract

How to train a machine learning model while keeping the data private and secure? We present CodedPrivateML, a fast and scalable approach to this critical problem. CodedPrivateML keeps both the data and the model information-theoretically private, while allowing efficient parallelization of training across distributed workers. We characterize CodedPrivateML's privacy threshold and prove its convergence for logistic (and linear) regression. Furthermore, via extensive experiments on Amazon EC2, we demonstrate that CodedPrivateML provides significant speedup over cryptographic approaches based on multi-party computing (MPC).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.00641/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1902.00641/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/1902.00641/full.md

---
Source: https://tomesphere.com/paper/1902.00641