DAVID-XR1: Detecting AI-Generated Videos with Explainable Reasoning

Yifeng Gao; Yifan Ding; Hongyu Su; Juncheng Li; Yunhan Zhao; Lin Luo; Zixing Chen; Li Wang; Xin Wang; Yixu Wang; Xingjun Ma; Yu-Gang Jiang

arXiv:2506.14827·cs.CV·June 19, 2025

DAVID-XR1: Detecting AI-Generated Videos with Explainable Reasoning

Yifeng Gao, Yifan Ding, Hongyu Su, Juncheng Li, Yunhan Zhao, Lin Luo, Zixing Chen, Li Wang, Xin Wang, Yixu Wang, Xingjun Ma, Yu-Gang Jiang

PDF

Open Access

TL;DR

This paper introduces DAVID-XR1, a novel video-language model that provides explainable, fine-grained detection of AI-generated videos using detailed annotations and reasoning, enhancing trustworthiness and transparency.

Contribution

It presents DAVID-XR1, the first model to offer interpretable, defect-level explanations for AI-generated video detection, supported by a new dataset with rich annotations.

Findings

01

Model achieves strong generalization across different AI video generators.

02

Fine-grained explanations improve trust and interpretability.

03

Chain-of-thought distillation enhances detection accuracy.

Abstract

As AI-generated video becomes increasingly pervasive across media platforms, the ability to reliably distinguish synthetic content from authentic footage has become both urgent and essential. Existing approaches have primarily treated this challenge as a binary classification task, offering limited insight into where or why a model identifies a video as AI-generated. However, the core challenge extends beyond simply detecting subtle artifacts; it requires providing fine-grained, persuasive evidence that can convince auditors and end-users alike. To address this critical gap, we introduce DAVID-X, the first dataset to pair AI-generated videos with detailed defect-level, temporal-spatial annotations and written rationales. Leveraging these rich annotations, we present DAVID-XR1, a video-language model designed to deliver an interpretable chain of visual reasoning-including defect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications