Exposing Deepfake Face Forgeries with Guided Residuals
Zhiqing Guo, Gaobo Yang, Jiyou Chen, Xingming Sun

TL;DR
This paper introduces GRnet, a novel deepfake detection model that fuses spatial and residual features using a manipulation trace extractor and attention mechanism, significantly improving detection accuracy.
Contribution
The paper proposes a guided residuals network (GRnet) with a manipulation trace extractor and attention fusion mechanism for enhanced deepfake face detection.
Findings
GRnet outperforms state-of-the-art methods on four public datasets.
Achieves 97.72% accuracy on HFF dataset, surpassing existing methods.
Effectively fuses spatial and residual features for robust detection.
Abstract
Residual-domain feature is very useful for Deepfake detection because it suppresses irrelevant content features and preserves key manipulation traces. However, inappropriate residual prediction will bring side effects on detection accuracy. In addition, residual-domain features are easily affected by image operations such as compression. Most existing works exploit either spatial-domain features or residual-domain features, while neglecting that two types of features are mutually correlated. In this paper, we propose a guided residuals network, namely GRnet, which fuses spatial-domain and residual-domain features in a mutually reinforcing way, to expose face images generated by Deepfake. Different from existing prediction based residual extraction methods, we propose a manipulation trace extractor (MTE) to directly remove the content features and preserve manipulation traces. MTE is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Face recognition and analysis · Generative Adversarial Networks and Image Synthesis
