TweepFake: about Detecting Deepfake Tweets

Tiziano Fagni; Fabrizio Falchi; Margherita Gambini; Antonio Martella,; Maurizio Tesconi

arXiv:2008.00036·cs.CL·June 9, 2021

TweepFake: about Detecting Deepfake Tweets

Tiziano Fagni, Fabrizio Falchi, Margherita Gambini, Antonio Martella,, Maurizio Tesconi

PDF

1 Repo

TL;DR

This paper introduces TweepFake, a dataset of real deepfake tweets and evaluates multiple detection methods to address the challenge of identifying machine-generated social media messages.

Contribution

It provides the first dataset of real deepfake tweets and benchmarks various detection techniques, advancing research in social media deepfake detection.

Findings

01

Detection remains challenging with current methods.

02

The dataset enables future research on deepfake social media messages.

03

Baseline detection results highlight the need for improved techniques.

Abstract

The recent advances in language modeling significantly improved the generative capabilities of deep neural models: in 2019 OpenAI released GPT-2, a pre-trained language model that can autonomously generate coherent, non-trivial and human-like text samples. Since then, ever more powerful text generative models have been developed. Adversaries can exploit these tremendous generative capabilities to enhance social bots that will have the ability to write plausible deepfake messages, hoping to contaminate public debate. To prevent this, it is crucial to develop deepfake social media messages detection systems. However, to the best of our knowledge no one has ever addressed the detection of machine-generated texts on social networks like Twitter or Facebook. With the aim of helping the research in this detection field, we collected the first dataset of \real deepfake tweets, TweepFake. It is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tizfa/tweepfake_deepfake_text_detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Cosine Annealing · Linear Warmup With Cosine Annealing · Dense Connections · Residual Connection · Byte Pair Encoding · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Attention Is All You Need · Discriminative Fine-Tuning