AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences

Jieyu Li; Xin Zhang; and Joey Tianyi Zhou

arXiv:2508.10771·cs.CV·August 15, 2025

AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences

Jieyu Li, Xin Zhang, and Joey Tianyi Zhou

PDF

TL;DR

AEGIS is a large-scale, challenging benchmark dataset designed to evaluate AI-generated video authenticity detection, highlighting current models' limitations and advancing research in robust forgery detection.

Contribution

Introduces AEGIS, a comprehensive benchmark with over 10,000 videos and multimodal annotations, addressing limitations of existing datasets and facilitating development of more reliable detection models.

Findings

01

Current models show limited detection on challenging AEGIS subsets

02

AEGIS's realism surpasses existing benchmarks, exposing detection gaps

03

Dataset supports multimodal and forgery localization research

Abstract

Recent advances in AI-generated content have fueled the rise of highly realistic synthetic videos, posing severe risks to societal trust and digital integrity. Existing benchmarks for video authenticity detection typically suffer from limited realism, insufficient scale, and inadequate complexity, failing to effectively evaluate modern vision-language models against sophisticated forgeries. To address this critical gap, we introduce AEGIS, a novel large-scale benchmark explicitly targeting the detection of hyper-realistic and semantically nuanced AI-generated videos. AEGIS comprises over 10,000 rigorously curated real and synthetic videos generated by diverse, state-of-the-art generative models, including Stable Video Diffusion, CogVideoX-5B, KLing, and Sora, encompassing open-source and proprietary architectures. In particular, AEGIS features specially constructed challenging subsets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.