MMNet: Multi-Collaboration and Multi-Supervision Network for Sequential Deepfake Detection
Ruiyang Xia, Decheng Liu, Jie Li, Lin Yuan, Nannan Wang, Xinbo Gao

TL;DR
This paper introduces MMNet, a novel deepfake detection framework that handles various manipulations and sequences without needing manipulation models, and proposes CSM, a new metric for evaluating sequential detection accuracy.
Contribution
The paper presents MMNet, a multi-collaboration and multi-supervision network that improves deepfake detection and recovery across diverse manipulations without prior model knowledge.
Findings
MMNet achieves state-of-the-art detection accuracy.
MMNet effectively handles various spatial and sequential manipulations.
The proposed CSM metric provides a comprehensive evaluation of sequential detection performance.
Abstract
Advanced manipulation techniques have provided criminals with opportunities to make social panic or gain illicit profits through the generation of deceptive media, such as forged face images. In response, various deepfake detection methods have been proposed to assess image authenticity. Sequential deepfake detection, which is an extension of deepfake detection, aims to identify forged facial regions with the correct sequence for recovery. Nonetheless, due to the different combinations of spatial and sequential manipulations, forged face images exhibit substantial discrepancies that severely impact detection performance. Additionally, the recovery of forged images requires knowledge of the manipulation model to implement inverse transformations, which is difficult to ascertain as relevant techniques are often concealed by attackers. To address these issues, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis
