Uncovering Gradient Inversion Risks in Practical Language Model Training

Xinguo Feng; Zhongkui Ma; Zihan Wang; Eu Joe Chegne; Mengyao Ma; Alsharif Abuadbba; Guangdong Bai

arXiv:2507.21198·cs.LG·July 30, 2025

Uncovering Gradient Inversion Risks in Practical Language Model Training

Xinguo Feng, Zhongkui Ma, Zihan Wang, Eu Joe Chegne, Mengyao Ma, Alsharif Abuadbba, Guangdong Bai

PDF

TL;DR

This paper introduces Grab, a novel gradient inversion attack tailored for language models in federated learning, demonstrating high data recovery rates and revealing significant privacy risks.

Contribution

It presents a domain-specific attack method that overcomes practical training challenges, significantly improving data recovery in language model privacy attacks.

Findings

01

Recover up to 92.9% of private data

02

Outperform existing methods by up to 28.9% in benchmark settings

03

Achieve 48.5% higher recovery in practical scenarios

Abstract

The gradient inversion attack has been demonstrated as a significant privacy threat to federated learning (FL), particularly in continuous domains such as vision models. In contrast, it is often considered less effective or highly dependent on impractical training settings when applied to language models, due to the challenges posed by the discrete nature of tokens in text data. As a result, its potential privacy threats remain largely underestimated, despite FL being an emerging training method for language models. In this work, we propose a domain-specific gradient inversion attack named Grab (gradient inversion with hybrid optimization). Grab features two alternating optimization processes to address the challenges caused by practical training settings, including a simultaneous optimization on dropout masks between layers for improved token recovery and a discrete optimization for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.