Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling
Jiebin Yan, Lei Wu, Yuming Fang, Xuelin Liu, Xue Xia, Weide Liu

TL;DR
This paper explores how minimal spatial and temporal sampling of videos can still maintain acceptable quality assessment performance, aiming to improve efficiency in online video quality assessment models.
Contribution
It introduces a joint spatial-temporal sampling strategy and an initial online VQA model that simplifies processing while preserving accuracy.
Findings
Acceptable VQA performance with heavily sampled videos
Effective joint sampling reduces computational load
Feasibility of simplified online VQA model demonstrated
Abstract
With the rapid development of multimedia processing and deep learning technologies, especially in the field of video understanding, video quality assessment (VQA) has achieved significant progress. Although researchers have moved from designing efficient video quality mapping models to various research directions, in-depth exploration of the effectiveness-efficiency trade-offs of spatio-temporal modeling in VQA models is still less sufficient. Considering the fact that videos have highly redundant information, this paper investigates this problem from the perspective of joint spatial and temporal sampling, aiming to seek the answer to how little information we should keep at least when feeding videos into the VQA models while with acceptable performance sacrifice. To this end, we drastically sample the video's information from both spatial and temporal dimensions, and the heavily…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Video Surveillance and Tracking Methods
