FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment   Sampling

Haoning Wu; Chaofeng Chen; Jingwen Hou; Liang Liao; Annan Wang; Wenxiu; Sun; Qiong Yan; Weisi Lin

arXiv:2207.02595·cs.CV·July 7, 2022·5 cites

FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

Haoning Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu, Sun, Qiong Yan, Weisi Lin

PDF

Open Access 4 Repos

TL;DR

FAST-VQA introduces an efficient end-to-end video quality assessment method that uses fragment sampling and a specialized network to significantly reduce computational costs while improving accuracy.

Contribution

The paper proposes Grid Mini-patch Sampling and a Fragment Attention Network to enhance VQA efficiency and accuracy, enabling effective high-resolution video assessment with minimal computation.

Findings

01

Improves state-of-the-art accuracy by around 10%.

02

Reduces 99.5% FLOPs on 1080P videos.

03

Maintains good performance across various resolutions.

Abstract

Current deep video quality assessment (VQA) methods are usually with high computational costs when evaluating high-resolution videos. This cost hinders them from learning better video-quality-related representations via end-to-end training. Existing approaches typically consider naive sampling to reduce the computational cost, such as resizing and cropping. However, they obviously corrupt quality-related information in videos and are thus not optimal for learning good representations for VQA. Therefore, there is an eager need to design a new quality-retained sampling scheme for VQA. In this paper, we propose Grid Mini-patch Sampling (GMS), which allows consideration of local quality by sampling patches at their raw resolution and covers global quality with contextual relations via mini-patches sampled in uniform grids. These mini-patches are spliced and aligned temporally, named as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Image and Signal Denoising Methods

MethodsLinear Layer · Softmax · Multi-Head Attention · Residual Connection · Attention Is All You Need · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Adam · Dropout