Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
David Ifeoluwa Adelani, Miaoran Zhang, Xiaoyu Shen, Ali Davody, Thomas, Kleinbauer, and Dietrich Klakow

TL;DR
This paper introduces a zero-shot multilingual back-translation method to reduce author profiling risks in texts, maintaining high utility for downstream tasks without requiring training data.
Contribution
It presents a novel zero-shot approach using off-the-shelf translation models for style transfer to enhance privacy in text data.
Findings
Reduces gender and race prediction accuracy by up to 22%
Retains 95% of original utility in downstream tasks
Outperforms five style transfer models in evaluations
Abstract
Documents as short as a single sentence may inadvertently reveal sensitive information about their authors, including e.g. their gender or ethnicity. Style transfer is an effective way of transforming texts in order to remove any information that enables author profiling. However, for a number of current state-of-the-art approaches the improved privacy is accompanied by an undesirable drop in the down-stream utility of the transformed data. In this paper, we propose a simple, zero-shot way to effectively lower the risk of author profiling through multilingual back-translation using off-the-shelf translation models. We compare our models with five representative text style transfer models on three datasets across different domains. Results from both an automatic and a human evaluation show that our approach achieves the best overall performance while requiring no training data. We are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Topic Modeling · Hate Speech and Cyberbullying Detection
