A Novel Local Focusing Mechanism for Deepfake Detection Generalization
Mingliang Li, Lin Yuanbo Wu, Changhong Liu, Hanxi Li

TL;DR
This paper introduces a Local Focus Mechanism (LFM) that enhances deepfake detection by attending to discriminative local features, improving cross-domain generalization and robustness over existing methods.
Contribution
The paper proposes a novel LFM with a Salience Network and Top-K Pooling, along with regularization techniques, to improve deepfake detection accuracy and generalization across domains.
Findings
LFM improves accuracy by 3.7% over NPR.
LFM achieves 1789 FPS on NVIDIA A6000.
LFM sets a new benchmark for cross-domain deepfake detection.
Abstract
The rapid advancement of deepfake generation techniques has intensified the need for robust and generalizable detection methods. Existing approaches based on reconstruction learning typically leverage deep convolutional networks to extract differential features. However, these methods show poor generalization across object categories (e.g., from faces to cars) and generation domains (e.g., from GANs to Stable Diffusion), due to intrinsic limitations of deep CNNs. First, models trained on a specific category tend to overfit to semantic feature distributions, making them less transferable to other categories, especially as network depth increases. Second, Global Average Pooling (GAP) compresses critical local forgery cues into a single vector, thus discarding discriminative patterns vital for real-fake classification. To address these issues, we propose a novel Local Focus Mechanism (LFM)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
