Evaluating the Stability of Deep Image Quality Assessment With Respect to Image Scaling
Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa

TL;DR
This paper investigates how image scaling affects the performance of deep neural network-based image quality assessment methods, revealing that the choice of image scale significantly impacts their accuracy and stability.
Contribution
It provides a comprehensive evaluation of four deep IQAs across multiple datasets, highlighting the importance of image scale selection for optimal performance.
Findings
Image scale significantly influences deep IQA performance.
PieAPP is the most stable deep IQA among those tested.
Optimal image scale varies depending on the method and dataset.
Abstract
Image quality assessment (IQA) is a fundamental metric for image processing tasks (e.g., compression). With full-reference IQAs, traditional IQAs, such as PSNR and SSIM, have been used. Recently, IQAs based on deep neural networks (deep IQAs), such as LPIPS and DISTS, have also been used. It is known that image scaling is inconsistent among deep IQAs, as some perform down-scaling as pre-processing, whereas others instead use the original image size. In this paper, we show that the image scale is an influential factor that affects deep IQA performance. We comprehensively evaluate four deep IQAs on the same five datasets, and the experimental results show that image scale significantly influences IQA performance. We found that the most appropriate image scale is often neither the default nor the original size, and the choice differs depending on the methods and datasets used. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Advanced Image Fusion Techniques · Image and Signal Denoising Methods
