A Probabilistic Quality Representation Approach to Deep Blind Image Quality Prediction
Hui Zeng, Lei Zhang, Alan C. Bovik

TL;DR
This paper introduces a probabilistic quality representation for deep blind image quality assessment, improving training robustness and prediction accuracy by modeling subjective score distributions instead of scalar scores.
Contribution
It proposes a novel probabilistic quality representation (PQR) that captures subjective score distributions, enhancing deep BIQA model training and performance.
Findings
PQR speeds up model training convergence.
PQR significantly improves quality prediction accuracy.
The method outperforms scalar score regression approaches.
Abstract
Blind image quality assessment (BIQA) remains a very challenging problem due to the unavailability of a reference image. Deep learning based BIQA methods have been attracting increasing attention in recent years, yet it remains a difficult task to train a robust deep BIQA model because of the very limited number of training samples with human subjective scores. Most existing methods learn a regression network to minimize the prediction error of a scalar image quality score. However, such a scheme ignores the fact that an image will receive divergent subjective scores from different subjects, which cannot be adequately represented by a single scalar number. This is particularly true on complex, real-world distorted images. Moreover, images may broadly differ in their distributions of assigned subjective scores. Recognizing this, we propose a new representation of perceptual image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Image Fusion Techniques · Advanced Image Processing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
