Information Leakage from Embedding in Large Language Models
Zhipeng Wan, Anda Cheng, Yinggui Wang, Lei Wang

TL;DR
This paper investigates privacy risks in large language models by demonstrating how embeddings can be exploited to reconstruct user inputs, and proposes a new method and defenses to mitigate these privacy threats.
Contribution
It introduces Embed Parrot, a Transformer-based approach for reconstructing inputs from deep layer embeddings, and offers defense mechanisms to protect user privacy.
Findings
Effective reconstruction from shallow layer embeddings
Embed Parrot successfully reconstructs inputs from deep layer embeddings
Proposed defenses reduce reconstruction success
Abstract
The widespread adoption of large language models (LLMs) has raised concerns regarding data privacy. This study aims to investigate the potential for privacy invasion through input reconstruction attacks, in which a malicious model provider could potentially recover user inputs from embeddings. We first propose two base methods to reconstruct original texts from a model's hidden states. We find that these two methods are effective in attacking the embeddings from shallow layers, but their effectiveness decreases when attacking embeddings from deeper layers. To address this issue, we then present Embed Parrot, a Transformer-based method, to reconstruct input from embeddings in deep layers. Our analysis reveals that Embed Parrot effectively reconstructs original inputs from the hidden states of ChatGLM-6B and Llama2-7B, showcasing stable performance across various token lengths and data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsAttention Is All You Need · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Softmax · Balanced Selection · Linear Layer · Parrot
