X2-DFD: A framework for eXplainable and eXtendable Deepfake Detection

Yize Chen; Zhiyuan Yan; Guangliang Cheng; Kangran Zhao; Siwei Lyu; Baoyuan Wu

arXiv:2410.06126·cs.CV·May 30, 2025

X2-DFD: A framework for eXplainable and eXtendable Deepfake Detection

Yize Chen, Zhiyuan Yan, Guangliang Cheng, Kangran Zhao, Siwei Lyu, Baoyuan Wu

PDF

Open Access

TL;DR

X2-DFD is a novel framework that enhances deepfake detection by combining explainability and extendability through multimodal large-language models, involving feature assessment, dataset construction, and fine-tuning.

Contribution

The paper introduces a comprehensive, plug-and-play framework that systematically evaluates, enhances, and extends deepfake detection capabilities of large-language models.

Findings

01

Improved detection accuracy demonstrated in experiments.

02

Enhanced explainability validated through human studies.

03

Framework's extensibility allows integration of new detectors.

Abstract

This paper proposes X2-DFD, an eXplainable and eXtendable framework based on multimodal large-language models (MLLMs) for deepfake detection, consisting of three key stages. The first stage, Model Feature Assessment, systematically evaluates the detectability of forgery-related features for the MLLM, generating a prioritized ranking of features based on their intrinsic importance to the model. The second stage, Explainable Dataset Construction, consists of two key modules: Strong Feature Strengthening, which is designed to enhance the model's existing detection and explanation capabilities by reinforcing its well-learned features, and Weak Feature Supplementing, which addresses gaps by integrating specific feature detectors (e.g., low-level artifact analyzers) to compensate for the MLLM's limitations. The third stage, Fine-tuning and Inference, involves fine-tuning the MLLM on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Digital Media Forensic Detection