Blind VQA on 360{\deg} Video via Progressively Learning from Pixels, Frames and Video
Li Yang, Mai Xu, Shengxi Li, Yichen Guo, Zulin Wang

TL;DR
This paper introduces ProVQA, a novel blind quality assessment method for 360-degree videos that mimics human perception by progressively learning from pixels, frames, and video-level cues, significantly improving accuracy.
Contribution
ProVQA uniquely models the progressive human perception process for 360-degree videos by integrating three specialized sub-networks for spatial, motion, and multi-frame quality assessment.
Findings
ProVQA outperforms existing BVQA methods on two datasets.
The approach effectively models spherical perception and motion cues.
Significant improvement in quality prediction accuracy.
Abstract
Blind visual quality assessment (BVQA) on 360{\textdegree} video plays a key role in optimizing immersive multimedia systems. When assessing the quality of 360{\textdegree} video, human tends to perceive its quality degradation from the viewport-based spatial distortion of each spherical frame to motion artifact across adjacent frames, ending with the video-level quality score, i.e., a progressive quality assessment paradigm. However, the existing BVQA approaches for 360{\textdegree} video neglect this paradigm. In this paper, we take into account the progressive paradigm of human perception towards spherical video quality, and thus propose a novel BVQA approach (namely ProVQA) for 360{\textdegree} video via progressively learning from pixels, frames and video. Corresponding to the progressive learning of pixels, frames and video, three sub-nets are designed in our ProVQA approach,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Advanced Image Fusion Techniques
MethodsAttentive Walk-Aggregating Graph Neural Network
