User-Level Label Leakage from Gradients in Federated Learning

Aidmar Wainakh; Fabrizio Ventola; Till M\"u{\ss}ig; Jens Keim; and Carlos Garcia Cordero; Ephraim Zimmer; Tim Grube; Kristian; Kersting; Max M\"uhlh\"auser

arXiv:2105.09369·cs.CR·January 4, 2022

User-Level Label Leakage from Gradients in Federated Learning

Aidmar Wainakh, Fabrizio Ventola, Till M\"u{\ss}ig, Jens Keim, and Carlos Garcia Cordero, Ephraim Zimmer, Tim Grube, Kristian, Kersting, Max M\"uhlh\"auser

PDF

Open Access 2 Repos

TL;DR

This paper reveals a new privacy risk in federated learning where shared gradients can leak user data labels, demonstrating an effective attack and discussing potential defenses.

Contribution

The paper introduces Label Leakage from Gradients (LLG), a novel attack that can accurately extract data labels from gradients in federated learning.

Findings

01

LLG effectively leaks labels with high accuracy.

02

The attack works across different batch sizes and classes.

03

Gradient compression can mitigate label leakage.

Abstract

Federated learning enables multiple users to build a joint model by sharing their model updates (gradients), while their raw data remains local on their devices. In contrast to the common belief that this provides privacy benefits, we here add to the very recent results on privacy risks when sharing gradients. Specifically, we investigate Label Leakage from Gradients (LLG), a novel attack to extract the labels of the users' training data from their shared gradients. The attack exploits the direction and magnitude of gradients to determine the presence or absence of any label. LLG is simple yet effective, capable of leaking potential sensitive information represented by labels, and scales well to arbitrary batch sizes and multiple classes. We mathematically and empirically demonstrate the validity of the attack under different settings. Moreover, empirical results show that LLG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning