Delving into Sequential Patches for Deepfake Detection
Jiazhi Guan, Hang Zhou, Zhibin Hong, Errui Ding, Jingdong Wang,, Chengbin Quan, Youjian Zhao

TL;DR
This paper introduces LTTD, a transformer-based framework that leverages local and temporal cues to improve deepfake detection, achieving state-of-the-art results and robustness against post-processing.
Contribution
The paper proposes a novel local-to-global learning protocol with a Local Sequence Transformer that models local temporal consistency for enhanced deepfake detection.
Findings
Achieves state-of-the-art performance on popular datasets.
Effectively detects local forgery cues.
Robust against post-processing techniques.
Abstract
Recent advances in face forgery techniques produce nearly visually untraceable deepfake videos, which could be leveraged with malicious intentions. As a result, researchers have been devoted to deepfake detection. Previous studies have identified the importance of local low-level cues and temporal information in pursuit to generalize well across deepfake methods, however, they still suffer from robustness problem against post-processings. In this work, we propose the Local- & Temporal-aware Transformer-based Deepfake Detection (LTTD) framework, which adopts a local-to-global learning protocol with a particular focus on the valuable temporal information within local sequences. Specifically, we propose a Local Sequence Transformer (LST), which models the temporal consistency on sequences of restricted spatial regions, where low-level information is hierarchically enhanced with shallow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Face recognition and analysis
MethodsAttention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Residual Connection · Dense Connections · Position-Wise Feed-Forward Layer · Adam · Dropout · Label Smoothing
