Evaluating Privacy Leakage in Split Learning
Xinchi Qiu, Ilias Leontiadis, Luca Melis, Alex Sablayrolles, Pierre, Stock

TL;DR
This paper investigates privacy risks in split learning, revealing that exchanged gradients can leak private information, but differential privacy can mitigate these risks with minimal impact on model performance.
Contribution
The study provides a comprehensive analysis of privacy leakage in split learning and evaluates mitigation strategies like differential privacy.
Findings
Gradients enable near-perfect private feature reconstruction.
Differential privacy effectively reduces privacy risks.
Mitigation strategies cause minimal training degradation.
Abstract
Privacy-Preserving machine learning (PPML) can help us train and deploy models that utilize private information. In particular, on-device machine learning allows us to avoid sharing raw data with a third-party server during inference. On-device models are typically less accurate when compared to their server counterparts due to the fact that (1) they typically only rely on a small set of on-device features and (2) they need to be small enough to run efficiently on end-user devices. Split Learning (SL) is a promising approach that can overcome these limitations. In SL, a large machine learning model is divided into two parts, with the bigger part residing on the server side and a smaller part executing on-device, aiming to incorporate the private features. However, end-to-end training of such models requires exchanging gradients at the cut layer, which might encode private features or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Internet Traffic Analysis and Secure E-voting
