Blind VQA on 360{\deg} Video via Progressively Learning from Pixels,   Frames and Video

Li Yang; Mai Xu; Shengxi Li; Yichen Guo; Zulin Wang

arXiv:2111.09503·cs.CV·November 19, 2021

Blind VQA on 360{\deg} Video via Progressively Learning from Pixels, Frames and Video

Li Yang, Mai Xu, Shengxi Li, Yichen Guo, Zulin Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces ProVQA, a novel blind quality assessment method for 360-degree videos that mimics human perception by progressively learning from pixels, frames, and video-level cues, significantly improving accuracy.

Contribution

ProVQA uniquely models the progressive human perception process for 360-degree videos by integrating three specialized sub-networks for spatial, motion, and multi-frame quality assessment.

Findings

01

ProVQA outperforms existing BVQA methods on two datasets.

02

The approach effectively models spherical perception and motion cues.

03

Significant improvement in quality prediction accuracy.

Abstract

Blind visual quality assessment (BVQA) on 360{\textdegree} video plays a key role in optimizing immersive multimedia systems. When assessing the quality of 360{\textdegree} video, human tends to perceive its quality degradation from the viewport-based spatial distortion of each spherical frame to motion artifact across adjacent frames, ending with the video-level quality score, i.e., a progressive quality assessment paradigm. However, the existing BVQA approaches for 360{\textdegree} video neglect this paradigm. In this paper, we take into account the progressive paradigm of human perception towards spherical video quality, and thus propose a novel BVQA approach (namely ProVQA) for 360{\textdegree} video via progressively learning from pixels, frames and video. Corresponding to the progressive learning of pixels, frames and video, three sub-nets are designed in our ProVQA approach,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yanglixiaoshen/provqa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Advanced Image Fusion Techniques

MethodsAttentive Walk-Aggregating Graph Neural Network