Seeing the Forest through the Trees: Data Leakage from Partial   Transformer Gradients

Weijun Li; Qiongkai Xu; Mark Dras

arXiv:2406.00999·cs.LG·October 7, 2024

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

Weijun Li, Qiongkai Xu, Mark Dras

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper demonstrates that even partial gradients from specific layers or components of Transformer models can leak training data, highlighting significant privacy risks and limited protection from differential privacy techniques.

Contribution

The study reveals that small portions of model gradients, including single layers or linear components, can cause data leakage, expanding understanding of privacy vulnerabilities in distributed training.

Findings

01

Gradients from a single Transformer layer can leak training data.

02

A linear component with only 0.54% of parameters can cause data leakage.

03

Differential privacy provides limited protection against this vulnerability.

Abstract

Recent studies have shown that distributed machine learning is vulnerable to gradient inversion attacks, where private training data can be reconstructed by analyzing the gradients of the models shared in training. Previous attacks established that such reconstructions are possible using gradients from all parameters in the entire models. However, we hypothesize that most of the involved modules, or even their sub-modules, are at risk of training data leakage, and we validate such vulnerabilities in various intermediate layers of language models. Our extensive experiments reveal that gradients from a single Transformer layer, or even a single linear component with 0.54% parameters, are susceptible to training data leakage. Additionally, we show that applying differential privacy on gradients during training offers limited protection against the novel vulnerability of data disclosure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

weijun-l/partial-gradients-leakage
pytorchOfficial

Videos

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients· underline

Taxonomy

TopicsFault Detection and Control Systems · Power Transformer Diagnostics and Insulation

MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention