DAGER: Exact Gradient Inversion for Large Language Models

Ivo Petrov; Dimitar I. Dimitrov; Maximilian Baader; Mark Niklas; M\"uller; Martin Vechev

arXiv:2405.15586·cs.LG·November 14, 2024·1 cites

DAGER: Exact Gradient Inversion for Large Language Models

Ivo Petrov, Dimitar I. Dimitrov, Maximilian Baader, Mark Niklas, M\"uller, Martin Vechev

PDF

Open Access 1 Repo 1 Video

TL;DR

DAGER is a novel algorithm that enables exact recovery of full input text batches from large language models' gradients, significantly improving over prior gradient inversion attacks in speed, scalability, and accuracy.

Contribution

DAGER introduces the first method for exact batch recovery in text domain gradient inversion, exploiting low-rank and discrete structures in LLM gradients.

Findings

01

Recovers full text batches up to size 128 with high accuracy

02

Achieves 20x faster speed than prior attacks

03

Scales to larger batches and models with superior reconstruction quality

Abstract

Federated learning works by aggregating locally computed gradients from multiple clients, thus enabling collaborative training without sharing private client data. However, prior work has shown that the data can actually be recovered by the server using so-called gradient inversion attacks. While these attacks perform well when applied on images, they are limited in the text domain and only permit approximate reconstruction of small batches and short input sequences. In this work, we propose DAGER, the first algorithm to recover whole batches of input text exactly. DAGER leverages the low-rank structure of self-attention layer gradients and the discrete nature of token embeddings to efficiently check if a given token sequence is part of the client data. We use this check to exactly recover full batches in the honest-but-curious setting without any prior on the data for both encoder- and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

insait-institute/dager-gradient-inversion
pytorchOfficial

Videos

DAGER: Exact Gradient Inversion for Large Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Machine Learning in Healthcare

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings