FakeFormer: Efficient Vulnerability-Driven Transformers for   Generalisable Deepfake Detection

Dat Nguyen; Marcella Astrid; Enjie Ghorbel; Djamila Aouada

arXiv:2410.21964·cs.CV·November 26, 2024

FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection

Dat Nguyen, Marcella Astrid, Enjie Ghorbel, Djamila Aouada

PDF

Open Access

TL;DR

FakeFormer enhances Vision Transformers for deepfake detection by focusing on artifact-vulnerable regions, leading to better generalization and efficiency without large datasets.

Contribution

The paper introduces FakeFormer, a novel ViT-based framework that explicitly learns to detect subtle forgery artifacts, improving deepfake detection performance.

Findings

01

Outperforms state-of-the-art on multiple datasets

02

Requires less training data for effective detection

03

Achieves better generalization and computational efficiency

Abstract

Recently, Vision Transformers (ViTs) have achieved unprecedented effectiveness in the general domain of image classification. Nonetheless, these models remain underexplored in the field of deepfake detection, given their lower performance as compared to Convolution Neural Networks (CNNs) in that specific context. In this paper, we start by investigating why plain ViT architectures exhibit a suboptimal performance when dealing with the detection of facial forgeries. Our analysis reveals that, as compared to CNNs, ViT struggles to model localized forgery artifacts that typically characterize deepfakes. Based on this observation, we propose a deepfake detection framework called FakeFormer, which extends ViTs to enforce the extraction of subtle inconsistency-prone information. For that purpose, an explicit attention learning guided by artifact-vulnerable patches and tailored to ViTs is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Digital Media Forensic Detection

MethodsSoftmax · Attention Is All You Need · Convolution