DF40: Toward Next-Generation Deepfake Detection

Zhiyuan Yan; Taiping Yao; Shen Chen; Yandan Zhao; Xinghe Fu; Junwei; Zhu; Donghao Luo; Chengjie Wang; Shouhong Ding; Yunsheng Wu; Li Yuan

arXiv:2406.13495·cs.CV·November 1, 2024·2 cites

DF40: Toward Next-Generation Deepfake Detection

Zhiyuan Yan, Taiping Yao, Shen Chen, Yandan Zhao, Xinghe Fu, Junwei, Zhu, Donghao Luo, Chengjie Wang, Shouhong Ding, Yunsheng Wu, Li Yuan

PDF

Open Access 1 Repo 4 Datasets

TL;DR

This paper introduces DF40, a diverse deepfake detection benchmark with 40 techniques, to address dataset limitations and evaluate detection methods comprehensively, aiming to improve real-world deepfake detection generalization.

Contribution

The paper presents DF40, a new diverse deepfake dataset with 40 techniques, and conducts extensive evaluations to identify factors affecting detection performance and generalization.

Findings

01

Dataset diversity impacts detection accuracy.

02

Current models struggle with new deepfake techniques.

03

Evaluation protocols influence perceived model robustness.

Abstract

We propose a new comprehensive benchmark to revolutionize the current deepfake detection field to the next generation. Predominantly, existing works identify top-notch detection algorithms and models by adhering to the common practice: training detectors on one specific dataset (e.g., FF++) and testing them on other prevalent deepfake datasets. This protocol is often regarded as a "golden compass" for navigating SoTA detectors. But can these stand-out "winners" be truly applied to tackle the myriad of realistic and diverse deepfakes lurking in the real world? If not, what underlying factors contribute to this gap? In this work, we found the dataset (both train and test) can be the "primary culprit" due to: (1) forgery diversity: Deepfake techniques are commonly referred to as both face forgery and entire image synthesis. Most existing datasets only contain partial types of them, with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YZY-stack/DF40
pytorchOfficial

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis