DePrompt: Desensitization and Evaluation of Personal Identifiable   Information in Large Language Model Prompts

Xiongtao Sun; Gan Liu; Zhipeng He; Hui Li; Xiaoguang Li

arXiv:2408.08930·cs.CR·August 20, 2024

DePrompt: Desensitization and Evaluation of Personal Identifiable Information in Large Language Model Prompts

Xiongtao Sun, Gan Liu, Zhipeng He, Hui Li, Xiaoguang Li

PDF

Open Access

TL;DR

DePrompt is a framework that enhances privacy protection in large language model prompts by desensitizing PII while maintaining prompt utility, using fine-tuning and adversarial methods evaluated through new metrics.

Contribution

This paper introduces DePrompt, a novel framework combining fine-tuning and adversarial desensitization techniques for privacy-preserving prompts in LLMs, with utility evaluation metrics.

Findings

01

DePrompt effectively reduces PII leakage in prompts.

02

The framework maintains high semantic content and usability.

03

Experimental results outperform existing methods in privacy and utility balance.

Abstract

Prompt serves as a crucial link in interacting with large language models (LLMs), widely impacting the accuracy and interpretability of model outputs. However, acquiring accurate and high-quality responses necessitates precise prompts, which inevitably pose significant risks of personal identifiable information (PII) leakage. Therefore, this paper proposes DePrompt, a desensitization protection and effectiveness evaluation framework for prompt, enabling users to safely and transparently utilize LLMs. Specifically, by leveraging large model fine-tuning techniques as the underlying privacy protection method, we integrate contextual attributes to define privacy types, achieving high-precision PII entity identification. Additionally, through the analysis of key features in prompt desensitization scenarios, we devise adversarial generative desensitization methods that retain important…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques