LatticeGen: A Cooperative Framework which Hides Generated Text in a   Lattice for Privacy-Aware Generation on Cloud

Mengke Zhang; Tianxing He; Tianle Wang; Lu Mi; Fatemehsadat; Mireshghallah; Binyi Chen; Hao Wang; Yulia Tsvetkov

arXiv:2309.17157·cs.CL·April 8, 2024·1 cites

LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud

Mengke Zhang, Tianxing He, Tianle Wang, Lu Mi, Fatemehsadat, Mireshghallah, Binyi Chen, Hao Wang, Yulia Tsvetkov

PDF

Open Access 1 Repo

TL;DR

LatticeGen introduces a privacy-preserving framework for cloud-based language model generation, allowing users to hide their generated text within a noised lattice, balancing privacy and generation quality.

Contribution

It presents a novel cooperative framework that enables user-controlled privacy in cloud-based LLM generation by embedding true outputs in a noised lattice with defense mechanisms against server attacks.

Findings

01

Over 50% of semantic content remains hidden under attack

02

Noised lattice degrades generation quality but enhances privacy

03

Effective defense against malicious server attacks

Abstract

In the current user-server interaction paradigm of prompted generation with large language models (LLM) on cloud, the server fully controls the generation process, which leaves zero options for users who want to keep the generated text to themselves. We propose LatticeGen, a cooperative framework in which the server still handles most of the computation while the user controls the sampling operation. The key idea is that the true generated sequence is mixed with noise tokens by the user and hidden in a noised lattice. Considering potential attacks from a hypothetically malicious server and how the user can defend against it, we propose the repeated beam-search attack and the mixing noise scheme. In our experiments we apply LatticeGen to protect both prompt and generation. It is shown that while the noised lattice degrades generation quality, LatticeGen successfully protects the true…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

z0zzz/latticegen
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Digital and Cyber Forensics · Adversarial Robustness in Machine Learning