Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs

Yuxin Liu; Fei Wang; Kun Li; Yiqi Nie; Junjie Chen; Zhangling Duan; Zhaohong Jia

arXiv:2603.17761·cs.CV·March 19, 2026

Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs

Yuxin Liu, Fei Wang, Kun Li, Yiqi Nie, Junjie Chen, Zhangling Duan, Zhaohong Jia

PDF

Open Access

TL;DR

This paper introduces SCEP, a training-free framework that enhances deepfake detection by selecting and reasoning over suspicious image patches using LVLMs, avoiding costly fine-tuning.

Contribution

SCEP is a novel evidence-driven approach that leverages LVLMs for cross-domain image deepfake detection without requiring model fine-tuning.

Findings

01

SCEP outperforms strong baselines on multiple benchmarks.

02

It effectively detects diverse and evolving deepfakes.

03

The method reduces reliance on costly model fine-tuning.

Abstract

Image Deepfake Detection (IDD) separates manipulated images from authentic ones by spotting artifacts of synthesis or tampering. Although large vision-language models (LVLMs) offer strong image understanding, adapting them to IDD often demands costly fine-tuning and generalizes poorly to diverse, evolving manipulations. We propose the Semantic Consistent Evidence Pack (SCEP), a training-free LVLM framework that replaces whole-image inference with evidence-driven reasoning. SCEP mines a compact set of suspicious patch tokens that best reveal manipulation cues. It uses the vision encoder's CLS token as a global reference, clusters patch features into coherent groups, and scores patches with a fused metric combining CLS-guided semantic mismatch with frequency-and noise-based anomalies. To cover dispersed traces and avoid redundancy, SCEP samples a few high-confidence patches per cluster…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning