Preventing Author Profiling through Zero-Shot Multilingual   Back-Translation

David Ifeoluwa Adelani; Miaoran Zhang; Xiaoyu Shen; Ali Davody; Thomas; Kleinbauer; and Dietrich Klakow

arXiv:2109.09133·cs.CL·September 21, 2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation

David Ifeoluwa Adelani, Miaoran Zhang, Xiaoyu Shen, Ali Davody, Thomas, Kleinbauer, and Dietrich Klakow

PDF

Open Access 1 Repo

TL;DR

This paper introduces a zero-shot multilingual back-translation method to reduce author profiling risks in texts, maintaining high utility for downstream tasks without requiring training data.

Contribution

It presents a novel zero-shot approach using off-the-shelf translation models for style transfer to enhance privacy in text data.

Findings

01

Reduces gender and race prediction accuracy by up to 22%

02

Retains 95% of original utility in downstream tasks

03

Outperforms five style transfer models in evaluations

Abstract

Documents as short as a single sentence may inadvertently reveal sensitive information about their authors, including e.g. their gender or ethnicity. Style transfer is an effective way of transforming texts in order to remove any information that enables author profiling. However, for a number of current state-of-the-art approaches the improved privacy is accompanied by an undesirable drop in the down-stream utility of the transformed data. In this paper, we propose a simple, zero-shot way to effectively lower the risk of author profiling through multilingual back-translation using off-the-shelf translation models. We compare our models with five representative text style transfer models on three datasets across different domains. Results from both an automatic and a human evaluation show that our approach achieves the best overall performance while requiring no training data. We are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uds-lsv/author-profiling-prevention-bt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Topic Modeling · Hate Speech and Cyberbullying Detection