Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics

Yuezun Li; Xin Yang; Pu Sun; Honggang Qi; Siwei Lyu

arXiv:1909.12962·cs.CR·March 17, 2020·103 cites

Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics

Yuezun Li, Xin Yang, Pu Sun, Honggang Qi, Siwei Lyu

PDF

Open Access 5 Repos 1 Video

TL;DR

Celeb-DF introduces a large-scale, high-quality DeepFake video dataset to better evaluate detection algorithms, addressing limitations of previous datasets with lower visual quality and realism.

Contribution

The paper presents Celeb-DF, a new challenging dataset with 5,639 high-quality DeepFake videos, improving the resources available for DeepFake detection research.

Findings

01

Celeb-DF videos are of higher visual quality than previous datasets.

02

Existing detection methods show decreased performance on Celeb-DF.

03

Celeb-DF highlights the need for more robust DeepFake detection algorithms.

Abstract

AI-synthesized face-swapping videos, commonly known as DeepFakes, is an emerging problem threatening the trustworthiness of online information. The need to develop and evaluate DeepFake detection algorithms calls for large-scale datasets. However, current DeepFake datasets suffer from low visual quality and do not resemble DeepFake videos circulated on the Internet. We present a new large-scale challenging DeepFake video dataset, Celeb-DF, which contains 5,639 high-quality DeepFake videos of celebrities generated using improved synthesis process. We conduct a comprehensive evaluation of DeepFake detection methods and datasets to demonstrate the escalated level of challenges posed by Celeb-DF.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Celeb-DF: A Large-Scale Challenging Dataset for DeepFake Forensics· youtube

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Advanced Steganography and Watermarking Techniques