Enhancing Data Privacy in Large Language Models through Private   Association Editing

Davide Venditti; Elena Sofia Ruzzetti; Giancarlo A. Xompero; Cristina; Giannone; Andrea Favalli; Raniero Romagnoli; Fabio Massimo Zanzotto

arXiv:2406.18221·cs.CL·October 17, 2024

Enhancing Data Privacy in Large Language Models through Private Association Editing

Davide Venditti, Elena Sofia Ruzzetti, Giancarlo A. Xompero, Cristina, Giannone, Andrea Favalli, Raniero Romagnoli, Fabio Massimo Zanzotto

PDF

Open Access

TL;DR

This paper introduces Private Association Editing (PAE), a novel method to prevent private data leakage in large language models by removing PII without retraining, enhancing data privacy in text-generation applications.

Contribution

The paper presents PAE, a new approach that effectively removes PII from LLM outputs without requiring model retraining, addressing privacy concerns in large language models.

Findings

01

PAE effectively removes PII from LLM outputs.

02

PAE outperforms baseline methods in privacy preservation.

03

PAE maintains model utility while enhancing privacy.

Abstract

Large language models (LLMs) require a significant redesign in solutions to preserve privacy in data-intensive applications due to their text-generation capabilities. Indeed, LLMs tend to memorize and emit private information when maliciously prompted. In this paper, we introduce Private Association Editing (PAE) as a novel defense approach for private data leakage. PAE is designed to effectively remove Personally Identifiable Information (PII) without retraining the model. Experimental results demonstrate the effectiveness of PAE with respect to alternative baseline methods. We believe PAE will serve as a critical tool in the ongoing effort to protect data privacy in LLMs, encouraging the development of safer models for real-world applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security