$A^{4}NT$: Author Attribute Anonymity by Adversarial Training of Neural   Machine Translation

Rakshith Shetty; Bernt Schiele; Mario Fritz

arXiv:1711.01921·cs.CR·February 20, 2018·5 cites

$A^{4}NT$: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation

Rakshith Shetty, Bernt Schiele, Mario Fritz

PDF

Open Access

TL;DR

This paper introduces $A^4NT$, a neural translation-based adversarial method that obfuscates author attributes in text to enhance privacy, using unpaired data and semantic preservation techniques.

Contribution

The paper presents a novel adversarial training approach combining neural translation and GANs for author attribute anonymization without requiring paired datasets.

Findings

01

$A^4NT$ effectively fools author attribute classifiers.

02

It preserves input semantics while anonymizing authorship.

03

The method improves author anonymity across multiple datasets.

Abstract

Text-based analysis methods allow to reveal privacy relevant author attributes such as gender, age and identify of the text's author. Such methods can compromise the privacy of an anonymous author even when the author tries to remove privacy sensitive content. In this paper, we propose an automatic method, called Adversarial Author Attribute Anonymity Neural Translation ( $A^{4} N T$ ), to combat such text-based adversaries. We combine sequence-to-sequence language models used in machine translation and generative adversarial networks to obfuscate author attributes. Unlike machine translation techniques which need paired data, our method can be trained on unpaired corpora of text containing different authors. Importantly, we propose and evaluate techniques to impose constraints on our $A^{4} N T$ to preserve the semantics of the input text. $A^{4} N T$ learns to make minimal changes to the input text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Topic Modeling · Hate Speech and Cyberbullying Detection