Highly Efficient No-reference 4K Video Quality Assessment with   Full-Pixel Covering Sampling and Training Strategy

Xiaoheng Tan; Jiabin Zhang; Yuhui Quan; Jing Li; Yajing Wu; Zilin Bian

arXiv:2407.20766·eess.IV·July 31, 2024

Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy

Xiaoheng Tan, Jiabin Zhang, Yuhui Quan, Jing Li, Yajing Wu, Zilin Bian

PDF

TL;DR

This paper introduces a novel no-reference 4K video quality assessment method that efficiently processes full-resolution videos using a new sampling strategy, frequency domain features, and a perceptual scoring scheme, outperforming existing methods.

Contribution

It presents the first NR 4K VQA approach with a full-pixel sampling strategy, frequency domain integration, and a perceptual scoring scheme, enabling high accuracy and efficiency.

Findings

01

Outperforms existing methods on 4K VQA datasets

02

Achieves state-of-the-art results across multiple datasets

03

Operates efficiently on consumer-grade GPUs

Abstract

Deep Video Quality Assessment (VQA) methods have shown impressive high-performance capabilities. Notably, no-reference (NR) VQA methods play a vital role in situations where obtaining reference videos is restricted or not feasible. Nevertheless, as more streaming videos are being created in ultra-high definition (e.g., 4K) to enrich viewers' experiences, the current deep VQA methods face unacceptable computational costs. Furthermore, the resizing, cropping, and local sampling techniques employed in these methods can compromise the details and content of original 4K videos, thereby negatively impacting quality assessment. In this paper, we propose a highly efficient and novel NR 4K VQA technology. Specifically, first, a novel data sampling and training strategy is proposed to tackle the problem of excessive resolution. This strategy allows the VQA Swin Transformer-based model to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.