# Privacy-preserving Distributed Machine Learning via Local Randomization   and ADMM Perturbation

**Authors:** Xin Wang, Hideaki Ishii, Linkang Du, Peng Cheng, Jiming Chen

arXiv: 1908.01059 · 2020-08-26

## TL;DR

This paper introduces a privacy-preserving distributed machine learning framework using ADMM with local randomization and noise perturbation, enabling heterogeneous privacy guarantees without trusting the server and minimizing privacy loss over iterations.

## Contribution

It proposes a novel ADMM-based DML framework that does not assume trusted servers and offers heterogeneous privacy levels based on data sensitivity and trust degrees.

## Key findings

- The framework effectively balances privacy and model accuracy.
- Experimental results validate the theoretical privacy guarantees.
- The approach reduces privacy loss over multiple ADMM iterations.

## Abstract

With the proliferation of training data, distributed machine learning (DML) is becoming more competent for large-scale learning tasks. However, privacy concerns have to be given priority in DML, since training data may contain sensitive information of users. In this paper, we propose a privacy-preserving ADMM-based DML framework with two novel features: First, we remove the assumption commonly made in the literature that the users trust the server collecting their data. Second, the framework provides heterogeneous privacy for users depending on data's sensitive levels and servers' trust degrees. The challenging issue is to keep the accumulation of privacy losses over ADMM iterations minimal. In the proposed framework, a local randomization approach, which is differentially private, is adopted to provide users with self-controlled privacy guarantee for the most sensitive information. Further, the ADMM algorithm is perturbed through a combined noise-adding method, which simultaneously preserves privacy for users' less sensitive information and strengthens the privacy protection of the most sensitive information. We provide detailed analyses on the performance of the trained model according to its generalization error. Finally, we conduct extensive experiments using real-world datasets to validate the theoretical results and evaluate the classification performance of the proposed framework.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.01059/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1908.01059/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1908.01059/full.md

---
Source: https://tomesphere.com/paper/1908.01059