KonVid-150k: A Dataset for No-Reference Video Quality Assessment of   Videos in-the-Wild

Franz G\"otz-Hahn; Vlad Hosu; Hanhe Lin; Dietmar Saupe

arXiv:1912.07966·cs.MM·March 2, 2021

KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

Franz G\"otz-Hahn, Vlad Hosu, Hanhe Lin, Dietmar Saupe

PDF

TL;DR

This paper introduces KonVid-150k, a large and diverse in-the-wild video quality assessment dataset, and proposes new deep learning methods that significantly improve VQA performance and generalization across datasets.

Contribution

The creation of the KonVid-150k dataset and the development of MLSP-VQA approaches that outperform existing models in in-the-wild VQA tasks.

Findings

01

MLSP-VQA-FF achieves SRCC of 0.82 on KoNViD-1k.

02

MLSP-VQA-FF surpasses previous models in cross-dataset tests.

03

KonVid-150k enables better generalization for VQA models.

Abstract

Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods under-perform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new in-the-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.