TL;DR
This paper introduces TweepFake, a dataset of real deepfake tweets and evaluates multiple detection methods to address the challenge of identifying machine-generated social media messages.
Contribution
It provides the first dataset of real deepfake tweets and benchmarks various detection techniques, advancing research in social media deepfake detection.
Findings
Detection remains challenging with current methods.
The dataset enables future research on deepfake social media messages.
Baseline detection results highlight the need for improved techniques.
Abstract
The recent advances in language modeling significantly improved the generative capabilities of deep neural models: in 2019 OpenAI released GPT-2, a pre-trained language model that can autonomously generate coherent, non-trivial and human-like text samples. Since then, ever more powerful text generative models have been developed. Adversaries can exploit these tremendous generative capabilities to enhance social bots that will have the ability to write plausible deepfake messages, hoping to contaminate public debate. To prevent this, it is crucial to develop deepfake social media messages detection systems. However, to the best of our knowledge no one has ever addressed the detection of machine-generated texts on social networks like Twitter or Facebook. With the aim of helping the research in this detection field, we collected the first dataset of \real deepfake tweets, TweepFake. It is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Cosine Annealing · Linear Warmup With Cosine Annealing · Dense Connections · Residual Connection · Byte Pair Encoding · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Attention Is All You Need · Discriminative Fine-Tuning
