DePrompt: Desensitization and Evaluation of Personal Identifiable Information in Large Language Model Prompts
Xiongtao Sun, Gan Liu, Zhipeng He, Hui Li, Xiaoguang Li

TL;DR
DePrompt is a framework that enhances privacy protection in large language model prompts by desensitizing PII while maintaining prompt utility, using fine-tuning and adversarial methods evaluated through new metrics.
Contribution
This paper introduces DePrompt, a novel framework combining fine-tuning and adversarial desensitization techniques for privacy-preserving prompts in LLMs, with utility evaluation metrics.
Findings
DePrompt effectively reduces PII leakage in prompts.
The framework maintains high semantic content and usability.
Experimental results outperform existing methods in privacy and utility balance.
Abstract
Prompt serves as a crucial link in interacting with large language models (LLMs), widely impacting the accuracy and interpretability of model outputs. However, acquiring accurate and high-quality responses necessitates precise prompts, which inevitably pose significant risks of personal identifiable information (PII) leakage. Therefore, this paper proposes DePrompt, a desensitization protection and effectiveness evaluation framework for prompt, enabling users to safely and transparently utilize LLMs. Specifically, by leveraging large model fine-tuning techniques as the underlying privacy protection method, we integrate contextual attributes to define privacy types, achieving high-precision PII entity identification. Additionally, through the analysis of key features in prompt desensitization scenarios, we devise adversarial generative desensitization methods that retain important…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
