Towards Interactive Deepfake Analysis
Lixiong Qin, Ning Jiang, Yang Zhang, Yuhan Qiu, Dingheng Zeng, Jiani, Hu, Weihong Deng

TL;DR
This paper introduces an interactive deepfake analysis framework using instruction-tuned multi-modal large language models, addressing dataset scarcity and efficiency issues, and providing a new benchmark and system for the community.
Contribution
It presents a novel instruction-tuning approach with a new dataset, benchmark, and an interactive analysis system for deepfake detection and explanation.
Findings
Created DFA-Instruct dataset for instruction following
Developed DFA-Bench for comprehensive evaluation
Built DFA-GPT system with LoRA for interactive analysis
Abstract
Existing deepfake analysis methods are primarily based on discriminative models, which significantly limit their application scenarios. This paper aims to explore interactive deepfake analysis by performing instruction tuning on multi-modal large language models (MLLMs). This will face challenges such as the lack of datasets and benchmarks, and low training efficiency. To address these issues, we introduce (1) a GPT-assisted data construction process resulting in an instruction-following dataset called DFA-Instruct, (2) a benchmark named DFA-Bench, designed to comprehensively evaluate the capabilities of MLLMs in deepfake detection, deepfake classification, and artifact description, and (3) construct an interactive deepfake analysis system called DFA-GPT, as a strong baseline for the community, with the Low-Rank Adaptation (LoRA) module. The dataset and code will be made available at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
