Model Attribution of Face-swap Deepfake Videos
Shan Jia, Xin Li, Siwei Lyu

TL;DR
This paper addresses the challenge of identifying the specific Deepfake generation model used, introducing a new dataset and a novel attention-based method that achieves over 70% accuracy in model attribution.
Contribution
It introduces the first dataset for Deepfake model attribution and proposes a novel spatial-temporal attention method for accurate source identification.
Findings
Most existing detection methods fail in model attribution.
The proposed method achieves over 70% accuracy on high-quality Deepfake dataset.
The dataset includes 6,450 videos from five different Autoencoder-based models.
Abstract
AI-created face-swap videos, commonly known as Deepfakes, have attracted wide attention as powerful impersonation attacks. Existing research on Deepfakes mostly focuses on binary detection to distinguish between real and fake videos. However, it is also important to determine the specific generation model for a fake video, which can help attribute it to the source for forensic investigation. In this paper, we fill this gap by studying the model attribution problem of Deepfake videos. We first introduce a new dataset with DeepFakes from Different Models (DFDM) based on several Autoencoder models. Specifically, five generation models with variations in encoder, decoder, intermediate layer, input resolution, and compression ratio have been used to generate a total of 6,450 Deepfake videos based on the same input. Then we take Deepfakes model attribution as a multiclass classification task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
