Beyond Flicker: Detecting Kinematic Inconsistencies for Generalizable Deepfake Video Detection
Alejandro Cobo, Roberto Valle, Jos\'e Miguel Buenaposada, Luis Baumela

TL;DR
This paper introduces a novel method for deepfake video detection that leverages synthetic data with manipulated facial motion to improve generalization to unseen manipulations.
Contribution
It proposes a biomechanical flaw detection approach using synthetic data with broken facial motion dependencies to enhance deepfake detection generalization.
Findings
Achieved state-of-the-art results on multiple benchmarks.
Effective in detecting subtle kinematic inconsistencies.
Improved generalization to unseen deepfake manipulations.
Abstract
Generalizing deepfake detection to unseen manipulations remains a key challenge. A recent approach to tackle this issue is to train a network with pristine face images that have been manipulated with hand-crafted artifacts to extract more generalizable clues. While effective for static images, extending this to the video domain is an open issue. Existing methods model temporal artifacts as frame-to-frame instabilities, overlooking a key vulnerability: the violation of natural motion dependencies between different facial regions. In this paper, we propose a synthetic video generation method that creates training data with subtle kinematic inconsistencies. We train an autoencoder to decompose facial landmark configurations into motion bases. By manipulating these bases, we selectively break the natural correlations in facial movements and introduce these artifacts into pristine videos via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
