RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection

Zhuo Wang; Xiliang Liu; Ligang Sun

arXiv:2512.10248·cs.CV·May 8, 2026

RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection

Zhuo Wang, Xiliang Liu, Ligang Sun

PDF

1 Video

TL;DR

RobustSora is a benchmark dataset designed to evaluate AI-generated video detectors' robustness against watermark manipulation, revealing detectors' reliance on watermark cues and emphasizing the need for watermark-aware evaluation.

Contribution

This work introduces RobustSora, a comprehensive benchmark with evaluation protocols to analyze how watermark presence affects AI-generated video detection.

Findings

01

Watermark manipulation causes accuracy changes of -9.4 to +1.6 percentage points across models.

02

Watermark-aware training improves detection robustness by 3-4 percentage points.

03

Detectors' reliance on watermark cues varies by generator, not architecture.

Abstract

The proliferation of AI-generated video models poses new challenges to information integrity and digital trust. A key confound, however, remains unaddressed: commercial generators embed visible overlay watermarks for provenance tracking, yet no existing benchmark controls for this variable, leaving open whether detectors learn genuine generation artefacts or merely associate watermark patterns with AI-generated labels. We present RobustSora, a benchmark of 6,500 manually verified videos in four categories: Authentic-Clean (A-C), Generated-Watermarked (G-W), Generated-DeWatermarked (G-DeW), and Authentic-Spoofed (A-S), sourced from Vript, DVF, and UltraVideo (authentic) and from Sora, Sora 2, Pika, Open-Sora 2, and KLing (generated). Two evaluation tasks isolate watermark effects: Task-I (Watermark Erasure Robustness) tests detection on watermark-removed AI videos; Task-II (Watermark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection· underline