VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video   Editing Quality Assessment

Shangkun Sun; Xiaoyu Liang; Songlin Fan; Wenxu Gao; Wei Gao

arXiv:2408.11481·cs.CV·December 19, 2024

VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

Shangkun Sun, Xiaoyu Liang, Songlin Fan, Wenxu Gao, Wei Gao

PDF

Open Access 2 Repos 1 Video

TL;DR

VE-Bench introduces a comprehensive benchmark suite and a new human-aligned metric for evaluating the quality of text-driven video editing, addressing the gap in effective quantitative assessment tools.

Contribution

It provides the first quality assessment dataset for video editing and proposes a novel metric aligned with human perception, improving evaluation accuracy.

Findings

01

VE-Bench DB includes diverse videos and human scores.

02

VE-Bench QA outperforms traditional metrics in alignment with human preferences.

03

The proposed metric effectively assesses text-video relevance and quality.

Abstract

Text-driven video editing has recently experienced rapid development. Despite this, evaluating edited videos remains a considerable challenge. Current metrics tend to fail to align with human perceptions, and effective quantitative metrics for video editing are still notably absent. To address this, we introduce VE-Bench, a benchmark suite tailored to the assessment of text-driven video editing. This suite includes VE-Bench DB, a video quality assessment (VQA) database for video editing. VE-Bench DB encompasses a diverse set of source videos featuring various motions and subjects, along with multiple distinct editing prompts, editing results from 8 different models, and the corresponding Mean Opinion Scores (MOS) from 24 human annotators. Based on VE-Bench DB, we further propose VE-Bench QA, a quantitative human-aligned measurement for the text-driven video editing task. In addition to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment· underline

Taxonomy

TopicsDigital Rights Management and Security · Multimedia Communication and Technology

MethodsSparse Evolutionary Training · ALIGN