TL;DR
RobustSora is a benchmark dataset designed to evaluate AI-generated video detectors' robustness against watermark manipulation, revealing detectors' reliance on watermark cues and emphasizing the need for watermark-aware evaluation.
Contribution
This work introduces RobustSora, a comprehensive benchmark with evaluation protocols to analyze how watermark presence affects AI-generated video detection.
Findings
Watermark manipulation causes accuracy changes of -9.4 to +1.6 percentage points across models.
Watermark-aware training improves detection robustness by 3-4 percentage points.
Detectors' reliance on watermark cues varies by generator, not architecture.
Abstract
The proliferation of AI-generated video models poses new challenges to information integrity and digital trust. A key confound, however, remains unaddressed: commercial generators embed visible overlay watermarks for provenance tracking, yet no existing benchmark controls for this variable, leaving open whether detectors learn genuine generation artefacts or merely associate watermark patterns with AI-generated labels. We present RobustSora, a benchmark of 6,500 manually verified videos in four categories: Authentic-Clean (A-C), Generated-Watermarked (G-W), Generated-DeWatermarked (G-DeW), and Authentic-Spoofed (A-S), sourced from Vript, DVF, and UltraVideo (authentic) and from Sora, Sora 2, Pika, Open-Sora 2, and KLing (generated). Two evaluation tasks isolate watermark effects: Task-I (Watermark Erasure Robustness) tests detection on watermark-removed AI videos; Task-II (Watermark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
