Pioneering Perceptual Video Fluency Assessment: A Novel Task with Benchmark Dataset and Baseline

Qizhi Xie; Kun Yuan; Yunpeng Qu; Ming Sun; Chao Zhou; Jihong Zhu

arXiv:2603.26055·cs.CV·March 30, 2026

Pioneering Perceptual Video Fluency Assessment: A Novel Task with Benchmark Dataset and Baseline

Qizhi Xie, Kun Yuan, Yunpeng Qu, Ming Sun, Chao Zhou, Jihong Zhu

PDF

TL;DR

This paper introduces Video Fluency Assessment as a new perceptual task, creating a dataset, benchmarking methods, and proposing a baseline model to improve temporal video quality evaluation.

Contribution

It pioneers VFA as a standalone task, constructs a dataset, benchmarks 23 methods, and proposes FluNet with T-PSA for better temporal fluency assessment.

Findings

01

FluVid dataset contains 4,606 videos with fluency scores.

02

Benchmarking reveals insights for VFA-specific model design.

03

FluNet achieves state-of-the-art performance in VFA.

Abstract

Accurately estimating humans' subjective feedback on video fluency, e.g., motion consistency and frame continuity, is crucial for various applications like streaming and gaming. Yet, it has long been overlooked, as prior arts have focused on solving it in the video quality assessment (VQA) task, merely as a sub-dimension of overall quality. In this work, we conduct pilot experiments and reveal that current VQA predictions largely underrepresent fluency, thereby limiting their applicability. To this end, we pioneer Video Fluency Assessment (VFA) as a standalone perceptual task focused on the temporal dimension. To advance VFA research, 1) we construct a fluency-oriented dataset, FluVid, comprising 4,606 in-the-wild videos with balanced fluency distribution, featuring the first-ever scoring criteria and human study for VFA. 2) We develop a large-scale benchmark of 23 methods, the most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.