Deepfake tweets automatic detection

Adam Frej; Adrian Kaminski; Piotr Marciniak; Szymon Szmajdzinski,; Soveatin Kuntur; Anna Wroblewska

arXiv:2406.16489·cs.CL·June 25, 2024

Deepfake tweets automatic detection

Adam Frej, Adrian Kaminski, Piotr Marciniak, Szymon Szmajdzinski,, Soveatin Kuntur, Anna Wroblewska

PDF

Open Access

TL;DR

This paper develops and evaluates machine learning methods using NLP techniques to detect AI-generated DeepFake tweets, aiming to combat misinformation and improve online information trustworthiness.

Contribution

It introduces a novel approach leveraging NLP and machine learning models trained on the TweepFake dataset for DeepFake tweet detection.

Findings

01

Effective models identified for DeepFake detection

02

Improved accuracy over baseline methods

03

Enhanced understanding of linguistic features in DeepFake texts

Abstract

This study addresses the critical challenge of detecting DeepFake tweets by leveraging advanced natural language processing (NLP) techniques to distinguish between genuine and AI-generated texts. Given the increasing prevalence of misinformation, our research utilizes the TweepFake dataset to train and evaluate various machine learning models. The objective is to identify effective strategies for recognizing DeepFake content, thereby enhancing the integrity of digital communications. By developing reliable methods for detecting AI-generated misinformation, this work contributes to a more trustworthy online information environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection