Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization   Tool for Mitigating the Risk of Whistleblower Re-Identification

Dimitri Staufer; Frank Pallas; Bettina Berendt

arXiv:2405.01097·cs.CY·May 3, 2024

Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification

Dimitri Staufer, Frank Pallas, Bettina Berendt

PDF

1 Repo

TL;DR

This paper introduces a semi-automated text sanitization tool designed to reduce whistleblower re-identification risks by evaluating and anonymizing textual features, including writing style, while maintaining content utility.

Contribution

It presents a novel classification and mitigation strategy involving the whistleblower in risk assessment and applies risk-adapted anonymization with style-neutral paraphrasing.

Findings

01

Reduces authorship attribution accuracy from 98.81% to 31.22%.

02

Preserves up to 73.1% of original semantics.

03

Effective in real-world whistleblower testimony scenarios.

Abstract

Whistleblowing is essential for ensuring transparency and accountability in both public and private sectors. However, (potential) whistleblowers often fear or face retaliation, even when reporting anonymously. The specific content of their disclosures and their distinct writing style may re-identify them as the source. Legal measures, such as the EU WBD, are limited in their scope and effectiveness. Therefore, computational methods to prevent re-identification are important complementary tools for encouraging whistleblowers to come forward. However, current text sanitization tools follow a one-size-fits-all approach and take an overly limited view of anonymity. They aim to mitigate identification risk by replacing typical high-risk words (such as person names and other NE labels) and combinations thereof with placeholders. Such an approach, however, is inadequate for the whistleblowing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dimitristaufer/Semi-Automated-Text-Sanitization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.