Evaluating Privacy Leakage in Split Learning

Xinchi Qiu; Ilias Leontiadis; Luca Melis; Alex Sablayrolles; Pierre; Stock

arXiv:2305.12997·cs.LG·January 23, 2024·1 cites

Evaluating Privacy Leakage in Split Learning

Xinchi Qiu, Ilias Leontiadis, Luca Melis, Alex Sablayrolles, Pierre, Stock

PDF

Open Access

TL;DR

This paper investigates privacy risks in split learning, revealing that exchanged gradients can leak private information, but differential privacy can mitigate these risks with minimal impact on model performance.

Contribution

The study provides a comprehensive analysis of privacy leakage in split learning and evaluates mitigation strategies like differential privacy.

Findings

01

Gradients enable near-perfect private feature reconstruction.

02

Differential privacy effectively reduces privacy risks.

03

Mitigation strategies cause minimal training degradation.

Abstract

Privacy-Preserving machine learning (PPML) can help us train and deploy models that utilize private information. In particular, on-device machine learning allows us to avoid sharing raw data with a third-party server during inference. On-device models are typically less accurate when compared to their server counterparts due to the fact that (1) they typically only rely on a small set of on-device features and (2) they need to be small enough to run efficiently on end-user devices. Split Learning (SL) is a promising approach that can overcome these limitations. In SL, a large machine learning model is divided into two parts, with the bigger part residing on the server side and a smaller part executing on-device, aiming to incorporate the private features. However, end-to-end training of such models requires exchanging gradients at the cut layer, which might encode private features or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Internet Traffic Analysis and Secure E-voting