FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling
Haoning Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu, Sun, Qiong Yan, Weisi Lin

TL;DR
FAST-VQA introduces an efficient end-to-end video quality assessment method that uses fragment sampling and a specialized network to significantly reduce computational costs while improving accuracy.
Contribution
The paper proposes Grid Mini-patch Sampling and a Fragment Attention Network to enhance VQA efficiency and accuracy, enabling effective high-resolution video assessment with minimal computation.
Findings
Improves state-of-the-art accuracy by around 10%.
Reduces 99.5% FLOPs on 1080P videos.
Maintains good performance across various resolutions.
Abstract
Current deep video quality assessment (VQA) methods are usually with high computational costs when evaluating high-resolution videos. This cost hinders them from learning better video-quality-related representations via end-to-end training. Existing approaches typically consider naive sampling to reduce the computational cost, such as resizing and cropping. However, they obviously corrupt quality-related information in videos and are thus not optimal for learning good representations for VQA. Therefore, there is an eager need to design a new quality-retained sampling scheme for VQA. In this paper, we propose Grid Mini-patch Sampling (GMS), which allows consideration of local quality by sampling patches at their raw resolution and covers global quality with contextual relations via mini-patches sampled in uniform grids. These mini-patches are spliced and aligned temporally, named as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Image and Signal Denoising Methods
MethodsLinear Layer · Softmax · Multi-Head Attention · Residual Connection · Attention Is All You Need · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing · Adam · Dropout
