GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for   Robust Deepfake Detection

Jiaxin Chen; Miao Hu; Dengyong Zhang; Jingyang Meng

arXiv:2501.13435·cs.CV·January 24, 2025

GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for Robust Deepfake Detection

Jiaxin Chen, Miao Hu, Dengyong Zhang, Jingyang Meng

PDF

Open Access

TL;DR

GC-ConsFlow is a dual-stream deepfake detection framework that combines spatial and temporal features, leveraging global context and optical flow residuals to improve robustness against natural facial motions and various compression scenarios.

Contribution

It introduces a novel dual-stream framework with global context aggregation and flow-gradient temporal consistency, enhancing detection of subtle artifacts and motion inconsistencies in deepfake videos.

Findings

01

Outperforms state-of-the-art methods in various compression scenarios

02

Effectively captures subtle spatial artifacts and temporal inconsistencies

03

Demonstrates robustness against natural facial motions

Abstract

The rapid development of Deepfake technology has enabled the generation of highly realistic manipulated videos, posing severe social and ethical challenges. Existing Deepfake detection methods primarily focused on either spatial or temporal inconsistencies, often neglecting the interplay between the two or suffering from interference caused by natural facial motions. To address these challenges, we propose the global context consistency flow (GC-ConsFlow), a novel dual-stream framework that effectively integrates spatial and temporal features for robust Deepfake detection. The global grouped context aggregation module (GGCA), integrated into the global context-aware frame flow stream (GCAF), enhances spatial feature extraction by aggregating grouped global context information, enabling the detection of subtle, spatial artifacts within frames. The flow-gradient temporal consistency…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis