AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences
Jieyu Li, Xin Zhang, and Joey Tianyi Zhou

TL;DR
AEGIS is a large-scale, challenging benchmark dataset designed to evaluate AI-generated video authenticity detection, highlighting current models' limitations and advancing research in robust forgery detection.
Contribution
Introduces AEGIS, a comprehensive benchmark with over 10,000 videos and multimodal annotations, addressing limitations of existing datasets and facilitating development of more reliable detection models.
Findings
Current models show limited detection on challenging AEGIS subsets
AEGIS's realism surpasses existing benchmarks, exposing detection gaps
Dataset supports multimodal and forgery localization research
Abstract
Recent advances in AI-generated content have fueled the rise of highly realistic synthetic videos, posing severe risks to societal trust and digital integrity. Existing benchmarks for video authenticity detection typically suffer from limited realism, insufficient scale, and inadequate complexity, failing to effectively evaluate modern vision-language models against sophisticated forgeries. To address this critical gap, we introduce AEGIS, a novel large-scale benchmark explicitly targeting the detection of hyper-realistic and semantically nuanced AI-generated videos. AEGIS comprises over 10,000 rigorously curated real and synthetic videos generated by diverse, state-of-the-art generative models, including Stable Video Diffusion, CogVideoX-5B, KLing, and Sora, encompassing open-source and proprietary architectures. In particular, AEGIS features specially constructed challenging subsets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
