Cross-Domain Local Characteristic Enhanced Deepfake Video Detection
Zihan Liu, Hanyi Wang, Shilin Wang

TL;DR
This paper introduces a novel deepfake detection method that leverages cross-domain local features from space, frequency, and time domains, focusing on critical facial regions to improve generalization across datasets.
Contribution
The proposed Cross-Domain Local Forensics (XDLF) framework uniquely combines multi-domain features and facial region analysis for more robust deepfake detection.
Findings
Outperforms state-of-the-art methods on multiple benchmarks
Demonstrates strong cross-dataset generalization
Ablation studies highlight the importance of cross-domain local features
Abstract
As ultra-realistic face forgery techniques emerge, deepfake detection has attracted increasing attention due to security concerns. Many detectors cannot achieve accurate results when detecting unseen manipulations despite excellent performance on known forgeries. In this paper, we are motivated by the observation that the discrepancies between real and fake videos are extremely subtle and localized, and inconsistencies or irregularities can exist in some critical facial regions across various information domains. To this end, we propose a novel pipeline, Cross-Domain Local Forensics (XDLF), for more general deepfake video detection. In the proposed pipeline, a specialized framework is presented to simultaneously exploit local forgery patterns from space, frequency, and time domains, thus learning cross-domain features to detect forgeries. Moreover, the framework leverages four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
