Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

Xudong Li; Zihao Huang; Yan Zhang; Yunhang Shen; Ke Li; Xiawu Zheng; Liujuan Cao; Rongrong Ji

arXiv:2409.05381·cs.CV·September 23, 2025

Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models

Xudong Li, Zihao Huang, Yan Zhang, Yunhang Shen, Ke Li, Xiawu Zheng, Liujuan Cao, Rongrong Ji

PDF

Open Access 1 Models

TL;DR

This paper introduces GRMP-IQA, a novel framework that adapts vision-language models for image quality assessment with limited data, using meta-learning and gradient regularization to achieve high accuracy.

Contribution

The paper proposes a new meta-prompt based IQA framework that efficiently adapts CLIP to IQA tasks with limited data, outperforming existing methods.

Findings

01

GRMP-IQA achieves competitive performance with only 20% of training data.

02

The framework outperforms state-of-the-art BIQA methods under limited data conditions.

03

Meta-learning and gradient regularization enhance model adaptation and prevent overfitting.

Abstract

Image Quality Assessment (IQA) remains an unresolved challenge in computer vision due to complex distortions, diverse image content, and limited data availability. Existing Blind IQA (BIQA) methods largely rely on extensive human annotations, which are labor-intensive and costly due to the demanding nature of creating IQA datasets. To reduce this dependency, we propose the Gradient-Regulated Meta-Prompt IQA Framework (GRMP-IQA), designed to efficiently adapt the visual-language pre-trained model, CLIP, to IQA tasks, achieving high accuracy even with limited data. GRMP-IQA consists of two core modules: (i) Meta-Prompt Pre-training Module and (ii) Quality-Aware Gradient Regularization. The Meta Prompt Pre-training Module leverages a meta-learning paradigm to pre-train soft prompts with shared meta-knowledge across different distortions, enabling rapid adaptation to various IQA tasks. On…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
zzhowe/GRMP-IQA
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Image and Video Quality Assessment · Image and Signal Denoising Methods

MethodsSoftmax · Attention Is All You Need · Contrastive Language-Image Pre-training