Label Leakage and Protection in Two-party Split Learning
Oscar Li, Jiankai Sun, Xin Yang, Weihao Gao, Hongyi Zhang, and Junyuan Xie, Virginia Smith, Chong Wang

TL;DR
This paper investigates label leakage risks in two-party split learning, demonstrating attack methods and proposing the $ exttt{Marvell}$ perturbation technique to enhance privacy without significantly sacrificing utility.
Contribution
The work introduces a threat model for label leakage, presents effective attack strategies, and proposes the $ exttt{Marvell}$ method to mitigate privacy risks in split learning.
Findings
Effective label recovery attacks demonstrated
$ exttt{Marvell}$ reduces label leakage significantly
Improved privacy-utility tradeoffs shown empirically
Abstract
Two-party split learning is a popular technique for learning a model across feature-partitioned data. In this work, we explore whether it is possible for one party to steal the private label information from the other party during split training, and whether there are methods that can protect against such attacks. Specifically, we first formulate a realistic threat model and propose a privacy loss metric to quantify label leakage in split learning. We then show that there exist two simple yet effective methods within the threat model that can allow one party to accurately recover private ground-truth labels owned by the other party. To combat these attacks, we propose several random perturbation techniques, including , an approach that strategically finds the structure of the noise perturbation by minimizing the amount of label leakage (measured through our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security
