Detecting AI-Generated Video via Frame Consistency
Long Ma, Zhiyuan Yan, Qinglang Guo, Yong Liao, Haiyang Yu, Pengyuan, Zhou

TL;DR
This paper introduces a new open-source dataset and a novel frame consistency-based detection method, DeCoF, to identify AI-generated videos, emphasizing temporal artifacts for improved generalizability.
Contribution
The paper presents the first open-source dataset for generated video detection and proposes DeCoF, a temporal artifact-focused detection model that outperforms spatial artifact-based methods.
Findings
DeCoF effectively detects videos from unseen generation models.
DeCoF demonstrates strong generalizability across proprietary models.
The dataset covers diverse forgery scenarios and generation techniques.
Abstract
The escalating quality of video generated by advanced video generation methods results in new security challenges, while there have been few relevant research efforts: 1) There is no open-source dataset for generated video detection, 2) No generated video detection method has been proposed so far. To this end, we propose an open-source dataset and a detection method for generated video for the first time. First, we propose a scalable dataset consisting of 964 prompts, covering various forgery targets, scenes, behaviors, and actions, as well as various generation models with different architectures and generation methods, including the most popular commercial models like OpenAI's Sora and Google's Veo. Second, we found via probing experiments that spatial artifact-based detectors lack generalizability. Hence, we propose a simple yet effective \textbf{de}tection model based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Video Analysis and Summarization · Generative Adversarial Networks and Image Synthesis
