Q-Save: Towards Scoring and Attribution for Generated Video Evaluation

Xiele Wu; Zicheng Zhang; Mingtao Chen; Yixian Liu; Yiming Liu; Shushi Wang; Zhichao Hu; Yuhong Liu; Guangtao Zhai; Xiaohong Liu

arXiv:2511.18825·cs.CV·March 2, 2026

Q-Save: Towards Scoring and Attribution for Generated Video Evaluation

Xiele Wu, Zicheng Zhang, Mingtao Chen, Yixian Liu, Yiming Liu, Shushi Wang, Zhichao Hu, Yuhong Liu, Guangtao Zhai, Xiaohong Liu

PDF

Open Access

TL;DR

Q-Save introduces a comprehensive dataset and a unified model for evaluating AI-generated videos, addressing previous limitations by jointly assessing quality dimensions and providing interpretability.

Contribution

It presents a new dataset with detailed annotations and a novel evaluation model that jointly scores and attributes quality in generated videos.

Findings

01

Q-Save outperforms existing methods in quality prediction accuracy.

02

The model provides interpretable attributions for video quality.

03

The dataset enables systematic evaluation of multiple quality dimensions.

Abstract

Evaluating AI-generated video (AIGV) quality hinges on three crucial dimensions: visual quality, dynamic quality, and text-video alignment. While numerous evaluation datasets and algorithms have been proposed, existing approaches are constrained by two limitations: the absence of systematic definitions for evaluation dimensions, and the isolated treatment of the three dimensions in separate models. Therefore, we introduce Q-Save, a holistic benchmark dataset and unified evaluation model for AIGV quality assessment. The Q-Save dataset contains nearly 10,000 video samples, each annotated with Mean Opinion Scores (MOS) and fine-grained attribution explanations across the three core dimensions. Leveraging this attribution-annotated dataset, we train the proposed Q-Save model, which adopts the SlowFast framework to balance accuracy and efficiency, and employs a three-stage training strategy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment · Visual Attention and Saliency Detection