Learning Generalized Spatial-Temporal Deep Feature Representation for   No-Reference Video Quality Assessment

Baoliang Chen; Lingyu Zhu; Guo Li; Hongfei Fan; and Shiqi Wang

arXiv:2012.13936·eess.IV·June 24, 2021·1 cites

Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment

Baoliang Chen, Lingyu Zhu, Guo Li, Hongfei Fan, and Shiqi Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a no-reference video quality assessment method that learns generalized spatial-temporal features, effectively handling variations in content, resolution, and frame rate to improve cross-dataset prediction accuracy.

Contribution

The paper proposes a novel spatial-temporal feature learning approach with Gaussian distribution constraints and a pyramid temporal aggregation module for enhanced generalization.

Findings

01

Outperforms state-of-the-art methods in cross-dataset evaluations

02

Achieves comparable performance on intra-dataset tests

03

Demonstrates high generalization capability across diverse video qualities

Abstract

In this work, we propose a no-reference video quality assessment method, aiming to achieve high-generalization capability in cross-content, -resolution and -frame rate quality prediction. In particular, we evaluate the quality of a video by learning effective feature representations in spatial-temporal domain. In the spatial domain, to tackle the resolution and content variations, we impose the Gaussian distribution constraints on the quality features. The unified distribution can significantly reduce the domain gap between different video samples, resulting in a more generalized quality feature representation. Along the temporal dimension, inspired by the mechanism of visual perception, we propose a pyramid temporal aggregation module by involving the short-term and long-term memory to aggregate the frame-level quality. Experiments show that our method outperforms the state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Baoliang93/GSTVQA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Advanced Image Processing Techniques · Image Enhancement Techniques