DP-BART for Privatized Text Rewriting under Local Differential Privacy

Timour Igamberdiev; Ivan Habernal

arXiv:2302.07636·cs.CR·August 14, 2025·1 cites

DP-BART for Privatized Text Rewriting under Local Differential Privacy

Timour Igamberdiev, Ivan Habernal

PDF

Open Access 1 Repo

TL;DR

The paper introduces DP-BART, a novel system for privatized text rewriting under local differential privacy, which reduces noise and improves performance over existing methods through innovative techniques.

Contribution

DP-BART employs a new clipping method, iterative pruning, and internal representation training to enhance privacy guarantees and performance in privatized text rewriting.

Findings

01

Outperforms existing LDP text rewriting systems

02

Reduces noise required for differential privacy guarantees

03

Effective across multiple textual datasets and tasks

Abstract

Privatized text rewriting with local differential privacy (LDP) is a recent approach that enables sharing of sensitive textual documents while formally guaranteeing privacy protection to individuals. However, existing systems face several issues, such as formal mathematical flaws, unrealistic privacy guarantees, privatization of only individual words, as well as a lack of transparency and reproducibility. In this paper, we propose a new system 'DP-BART' that largely outperforms existing LDP systems. Our approach uses a novel clipping method, iterative pruning, and further training of internal representations which drastically reduces the amount of noise required for DP guarantees. We run experiments on five textual datasets of varying sizes, rewriting them at different privacy guarantees and evaluating the rewritten texts on downstream text classification tasks. Finally, we thoroughly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trusthlt/dp-bart-private-rewriting
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data