Differentially Private Language Models for Secure Data Sharing
Justus Mattern, Zhijing Jin, Benjamin Weggenmann, Bernhard Schoelkopf,, Mrinmaya Sachan

TL;DR
This paper introduces a method for generating high-quality, private synthetic text data using differentially private training of language models, enabling secure data sharing with strong privacy guarantees and improved downstream task performance.
Contribution
It proposes a novel approach combining global differential privacy with prompt-based language model training to produce accurate, private, and attribute-controlled textual datasets.
Findings
Synthetic data does not leak original data information.
Generated datasets are of high language quality.
Classifiers trained on private synthetic data outperform those trained on real data with DP-SGD.
Abstract
To protect the privacy of individuals whose data is being shared, it is of high importance to develop methods allowing researchers and companies to release textual data while providing formal privacy guarantees to its originators. In the field of NLP, substantial efforts have been directed at building mechanisms following the framework of local differential privacy, thereby anonymizing individual text samples before releasing them. In practice, these approaches are often dissatisfying in terms of the quality of their output language due to the strong noise required for local differential privacy. In this paper, we approach the problem at hand using global differential privacy, particularly by training a generative language model in a differentially private manner and consequently sampling data from it. Using natural language prompts and a new prompt-mismatch loss, we are able to create…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
