TL;DR
This paper introduces a semi-automated text sanitization tool designed to reduce whistleblower re-identification risks by evaluating and anonymizing textual features, including writing style, while maintaining content utility.
Contribution
It presents a novel classification and mitigation strategy involving the whistleblower in risk assessment and applies risk-adapted anonymization with style-neutral paraphrasing.
Findings
Reduces authorship attribution accuracy from 98.81% to 31.22%.
Preserves up to 73.1% of original semantics.
Effective in real-world whistleblower testimony scenarios.
Abstract
Whistleblowing is essential for ensuring transparency and accountability in both public and private sectors. However, (potential) whistleblowers often fear or face retaliation, even when reporting anonymously. The specific content of their disclosures and their distinct writing style may re-identify them as the source. Legal measures, such as the EU WBD, are limited in their scope and effectiveness. Therefore, computational methods to prevent re-identification are important complementary tools for encouraging whistleblowers to come forward. However, current text sanitization tools follow a one-size-fits-all approach and take an overly limited view of anonymity. They aim to mitigate identification risk by replacing typical high-risk words (such as person names and other NE labels) and combinations thereof with placeholders. Such an approach, however, is inadequate for the whistleblowing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
