Label Leakage and Protection in Two-party Split Learning

Oscar Li; Jiankai Sun; Xin Yang; Weihao Gao; Hongyi Zhang; and Junyuan Xie; Virginia Smith; Chong Wang

arXiv:2102.08504·cs.LG·May 26, 2022·64 cites

Label Leakage and Protection in Two-party Split Learning

Oscar Li, Jiankai Sun, Xin Yang, Weihao Gao, Hongyi Zhang, and Junyuan Xie, Virginia Smith, Chong Wang

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper investigates label leakage risks in two-party split learning, demonstrating attack methods and proposing the $ exttt{Marvell}$ perturbation technique to enhance privacy without significantly sacrificing utility.

Contribution

The work introduces a threat model for label leakage, presents effective attack strategies, and proposes the $ exttt{Marvell}$ method to mitigate privacy risks in split learning.

Findings

01

Effective label recovery attacks demonstrated

02

$ exttt{Marvell}$ reduces label leakage significantly

03

Improved privacy-utility tradeoffs shown empirically

Abstract

Two-party split learning is a popular technique for learning a model across feature-partitioned data. In this work, we explore whether it is possible for one party to steal the private label information from the other party during split training, and whether there are methods that can protect against such attacks. Specifically, we first formulate a realistic threat model and propose a privacy loss metric to quantify label leakage in split learning. We then show that there exist two simple yet effective methods within the threat model that can allow one party to accurately recover private ground-truth labels owned by the other party. To combat these attacks, we propose several random perturbation techniques, including $Marvell$ , an approach that strategically finds the structure of the noise perturbation by minimizing the amount of label leakage (measured through our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Label Leakage and Protection in Two-party Split Learning· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security