Neighbourhood Representative Sampling for Efficient End-to-end Video   Quality Assessment

Haoning Wu; Chaofeng Chen; Liang Liao; Jingwen Hou; Wenxiu Sun; Qiong; Yan; Jinwei Gu; Weisi Lin

arXiv:2210.05357·cs.CV·October 12, 2022·1 cites

Neighbourhood Representative Sampling for Efficient End-to-end Video Quality Assessment

Haoning Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong, Yan, Jinwei Gu, Weisi Lin

PDF

Open Access 4 Repos

TL;DR

This paper introduces a novel sampling scheme and a specialized network architecture for efficient and accurate video quality assessment, significantly reducing computational costs while improving performance.

Contribution

It proposes a spatial-temporal grid mini-cube sampling method and the Fragment Attention Network for end-to-end VQA, achieving high accuracy with minimal computational resources.

Findings

01

Achieves superior VQA performance on benchmarks.

02

Requires only 1/1612 of the FLOPs of state-of-the-art methods.

03

Provides open-source code, models, and demos.

Abstract

The increased resolution of real-world videos presents a dilemma between efficiency and accuracy for deep Video Quality Assessment (VQA). On the one hand, keeping the original resolution will lead to unacceptable computational costs. On the other hand, existing practices, such as resizing and cropping, will change the quality of original videos due to the loss of details and contents, and are therefore harmful to quality assessment. With the obtained insight from the study of spatial-temporal redundancy in the human visual system and visual coding theory, we observe that quality information around a neighbourhood is typically similar, motivating us to investigate an effective quality-sensitive neighbourhood representatives scheme for VQA. In this work, we propose a unified scheme, spatial-temporal grid mini-cube sampling (St-GMS) to get a novel type of sample, named fragments.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Advanced Image Processing Techniques · Advanced Image Fusion Techniques