InferDPT: Privacy-Preserving Inference for Closed-box Large Language Model
Meng Tong, Kejiang Chen, Jie Zhang, Yuang Qi, Weiming Zhang, Nenghai Yu, Tianwei Zhang, Zhikun Zhang

TL;DR
InferDPT introduces a practical framework for privacy-preserving inference in black-box large language models using differential privacy, perturbation, and extraction modules, achieving high privacy protection with minimal utility loss.
Contribution
This paper presents InferDPT, the first practical differential privacy framework for black-box LLM inference, incorporating RANTEXT to enhance privacy against embedding revision attacks.
Findings
InferDPT maintains text quality comparable to GPT-4 under privacy constraints.
RANTEXT outperforms SANTEXT+ and CUSTEXT+ in privacy-utility trade-offs.
Achieves over 90% privacy protection against embedding revision attacks at epsilon=6.0.
Abstract
Large language models (LLMs), like ChatGPT, have greatly simplified text generation tasks. However, they have also raised concerns about privacy risks such as data leakage and unauthorized data collection. Existing solutions for privacy-preserving inference face practical challenges related to computation time and communication costs. In this paper, we propose InferDPT, the first practical framework for the privacy-preserving Inference of black-box LLMs, implementing Differential Privacy in Text generation. InferDPT comprises two key modules: the "perturbation module" utilizes the exponential mechanism to generate a perturbed prompt, facilitating privacy-preserving inference with black-box LLMs, and the "extraction module", inspired by knowledge distillation and retrieval-augmented generation, extracts coherent and consistent text from the perturbed generation result, ensuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
