EvolveReason: Self-Evolving Reasoning Paradigm for Explainable Deepfake Facial Image Identification
Binjia Zhou, Dawei Luo, Shuai Chen, Feng Xu, Seow, Haoyuan Li, Jiachi Wang, Jiawen Wang, Zunlei Feng, Yijun Bei

TL;DR
EvolveReason is a novel self-evolving reasoning framework for deepfake facial image identification that mimics human reasoning, improves explainability, and enhances detection accuracy through a chain-of-thought dataset, latent-space analysis, and reinforcement learning.
Contribution
It introduces a human-like reasoning paradigm with a chain-of-thought dataset, forgery latent-space analysis, and self-evolution strategies to improve deepfake detection and explanation.
Findings
Outperforms state-of-the-art in detection accuracy
Provides detailed forgery explanations
Demonstrates strong generalization capabilities
Abstract
With the rapid advancement of AIGC technology, developing identification methods to address the security challenges posed by deepfakes has become urgent. Face forgery identification techniques can be categorized into two types: traditional classification methods and explainable VLM approaches. The former provides classification results but lacks explanatory ability, while the latter, although capable of providing coarse-grained explanations, often suffers from hallucinations and insufficient detail. To overcome these limitations, we propose EvolveReason, which mimics the reasoning and observational processes of human auditors when identifying face forgeries. By constructing a chain-of-thought dataset, CoT-Face, tailored for advanced VLMs, our approach guides the model to think in a human-like way, prompting it to output reasoning processes and judgment results. This provides…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning
