Are You Robert or RoBERTa? Deceiving Online Authorship Attribution   Models Using Neural Text Generators

Keenan Jones; Jason R. C. Nurse; Shujun Li

arXiv:2203.09813·cs.CL·March 21, 2022

Are You Robert or RoBERTa? Deceiving Online Authorship Attribution Models Using Neural Text Generators

Keenan Jones, Jason R. C. Nurse, Shujun Li

PDF

Open Access

TL;DR

This paper investigates whether advanced AI language models like GPT-2 can generate texts that deceive online authorship attribution systems, revealing potential vulnerabilities in author identification methods used in digital forensics.

Contribution

It demonstrates that current neural text generators can effectively mimic individual author styles to deceive authorship attribution models.

Findings

01

AI-generated texts can successfully mimic authorial style

02

Current AA models can be deceived by GPT-2 generated texts

03

Implications for spam detection and forensic investigations

Abstract

Recently, there has been a rise in the development of powerful pre-trained natural language models, including GPT-2, Grover, and XLM. These models have shown state-of-the-art capabilities towards a variety of different NLP tasks, including question answering, content summarisation, and text generation. Alongside this, there have been many studies focused on online authorship attribution (AA). That is, the use of models to identify the authors of online texts. Given the power of natural language models in generating convincing texts, this paper examines the degree to which these language models can generate texts capable of deceiving online AA models. Experimenting with both blog and Twitter data, we utilise GPT-2 language models to generate texts using the existing posts of online users. We then examine whether these AI-based text generators are capable of mimicking authorial style to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Hate Speech and Cyberbullying Detection · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Adam · Linear Warmup With Cosine Annealing · Byte Pair Encoding · Softmax · Attention Dropout · Residual Connection