Provably Confidential Language Modelling

Xuandong Zhao; Lei Li; Yu-Xiang Wang

arXiv:2205.01863·cs.CL·June 27, 2022

Provably Confidential Language Modelling

Xuandong Zhao, Lei Li, Yu-Xiang Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Confidentially Redacted Training (CRT), a method that provably prevents language models from memorizing sensitive information during training, while maintaining comparable performance to standard models.

Contribution

The paper presents CRT, a novel training approach that integrates privacy guarantees into language models, inspired by differential privacy, and applicable to LSTM and GPT architectures.

Findings

01

CRT prevents unintended memorization of confidential data.

02

Models trained with CRT maintain similar perplexity to standard models.

03

CRT enhances confidentiality with minimal impact on model performance.

Abstract

Large language models are shown to memorize privacy information such as social security numbers in training data. Given the sheer scale of the training corpus, it is challenging to screen and filter these privacy data, either manually or automatically. In this paper, we propose Confidentially Redacted Training (CRT), a method to train language generation models while protecting the confidential segments. We borrow ideas from differential privacy (which solves a related but distinct problem) and show that our method is able to provably prevent unintended memorization by randomizing parts of the training process. Moreover, we show that redaction with an approximately correct screening policy amplifies the confidentiality guarantee. We implement the method for both LSTM and GPT language models. Our experimental results show that the models trained by CRT obtain almost the same perplexity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xuandongzhao/crt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data

MethodsAttention Is All You Need · Linear Layer · Cosine Annealing · Linear Warmup With Cosine Annealing · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Softmax · Weight Decay · Adam