Learning Generalizable Perceptual Representations for Data-Efficient   No-Reference Image Quality Assessment

Suhas Srinath; Shankhanil Mitra; Shika Rao; Rajiv Soundararajan

arXiv:2312.04838·cs.CV·December 11, 2023·1 cites

Learning Generalizable Perceptual Representations for Data-Efficient No-Reference Image Quality Assessment

Suhas Srinath, Shankhanil Mitra, Shika Rao, Rajiv Soundararajan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach for no-reference image quality assessment that combines unsupervised low-level feature learning with high-level vision-language model fine-tuning, enabling data-efficient and zero-shot quality prediction across diverse distortions.

Contribution

It proposes a new contrastive loss for distortion-agnostic feature learning and leverages vision-language models for high-level quality assessment, enhancing generalizability and reducing annotation needs.

Findings

01

Outperforms existing methods on diverse datasets.

02

Effective in data-efficient and zero-shot scenarios.

03

Combines low-level and high-level features for improved accuracy.

Abstract

No-reference (NR) image quality assessment (IQA) is an important tool in enhancing the user experience in diverse visual applications. A major drawback of state-of-the-art NR-IQA techniques is their reliance on a large number of human annotations to train models for a target IQA application. To mitigate this requirement, there is a need for unsupervised learning of generalizable quality representations that capture diverse distortions. We enable the learning of low-level quality features agnostic to distortion types by introducing a novel quality-aware contrastive loss. Further, we leverage the generalizability of vision-language models by fine-tuning one such model to extract high-level image quality information through relevant text prompts. The two sets of features are combined to effectively predict quality by training a simple regressor with very few samples on a target dataset.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

suhas-srinath/grepq
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Advanced Image Fusion Techniques