Interpretable facial dynamics as behavioral and perceptual traces of deepfakes
Timothy Joseph Murphy, Jennifer Cook, H\'elio Clemente Jos\'e Cuve

TL;DR
This study introduces an interpretable, bio-behavioral approach to deepfake detection based on facial dynamics, revealing that emotional expressions significantly enhance detection accuracy and that model and human judgments often align.
Contribution
The paper proposes a bio-behavioral feature-based method for deepfake detection, linking computational strategies to human perception and highlighting the importance of emotional expressions.
Findings
Higher-order temporal irregularities are more pronounced in deepfakes.
Detection accuracy improves with videos showing emotional expressions.
Model and human judgments align more for emotive videos, diverge for non-emotive ones.
Abstract
Deepfake detection research has largely converged on deep learning approaches that, despite strong benchmark performance, offer limited insight into what distinguishes real from manipulated facial behavior. This study presents an interpretable alternative grounded in bio-behavioral features of facial dynamics and evaluates how computational detection strategies relate to human perceptual judgments. We identify core low-dimensional patterns of facial movement, from which temporal features characterizing spatiotemporal structure were derived. Traditional machine learning classifiers trained on these features achieved modest but significant above-chance deepfake classification, driven by higher-order temporal irregularities that were more pronounced in manipulated than real facial dynamics. Notably, detection was substantially more accurate for videos containing emotive expressions than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
