Towards More General Video-based Deepfake Detection through Facial   Component Guided Adaptation for Foundation Model

Yue-Hua Han; Tai-Ming Huang; Kai-Lung Hua; Jun-Cheng Chen

arXiv:2404.05583·cs.CV·March 18, 2025·1 cites

Towards More General Video-based Deepfake Detection through Facial Component Guided Adaptation for Foundation Model

Yue-Hua Han, Tai-Ming Huang, Kai-Lung Hua, Jun-Cheng Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for video-based Deepfake detection that leverages foundation models with facial component guidance to improve generalization, efficiency, and robustness against unseen forgeries.

Contribution

It proposes a side-network decoder with facial component guidance that enhances the generalization of foundation models for Deepfake detection in videos.

Findings

01

Demonstrates improved generalization on challenging Deepfake datasets.

02

Shows superiority in training data and parameter efficiency.

03

Achieves increased model robustness against unseen forgeries.

Abstract

Generative models have enabled the creation of highly realistic facial-synthetic images, raising significant concerns due to their potential for misuse. Despite rapid advancements in the field of deepfake detection, developing efficient approaches to leverage foundation models for improved generalizability to unseen forgery samples remains challenging. To address this challenge, we propose a novel side-network-based decoder that extracts spatial and temporal cues using the CLIP image encoder for generalized video-based Deepfake detection. Additionally, we introduce Facial Component Guidance (FCG) to enhance spatial learning generalizability by encouraging the model to focus on key facial regions. By leveraging the generic features of a vision-language foundation model, our approach demonstrates promising generalizability on challenging Deepfake datasets while also exhibiting superiority…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aiiu-lab/dfd-fcg
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis

MethodsContrastive Language-Image Pre-training