Multi-Granularity Reasoning for Image Quality Assessment via Attribute-Aware Reinforcement Learning to Rank

Xiangyong Chen; Xiaochuan Lin; Haoran Liu; Xuan Li; Yichen Su; Xiangwei Guo

arXiv:2604.09704·cs.CV·April 14, 2026

Multi-Granularity Reasoning for Image Quality Assessment via Attribute-Aware Reinforcement Learning to Rank

Xiangyong Chen, Xiaochuan Lin, Haoran Liu, Xuan Li, Yichen Su, Xiangwei Guo

PDF

TL;DR

This paper introduces MG-IQA, a multi-granularity reasoning framework using reinforcement learning to assess overall and attribute-specific image quality simultaneously, improving accuracy and interpretability.

Contribution

It extends RL-based image quality assessment to multi-attribute evaluation with a novel prompting, reward, and training mechanism, enabling joint assessment across datasets.

Findings

01

Outperforms state-of-the-art in overall quality prediction with 2.1% SRCC improvement.

02

Achieves superior attribute-level assessment accuracy.

03

Provides interpretable, human-aligned quality descriptions.

Abstract

Recent advances in reasoning-induced image quality assessment (IQA) have demonstrated the power of reinforcement learning to rank (RL2R) for training vision-language models (VLMs) to assess perceptual quality. However, existing approaches operate at a single granularity, predicting only an overall quality score, while overlooking the multi-dimensional nature of human quality perception, which encompasses attributes such as sharpness, color fidelity, noise level, and compositional aesthetics. In this paper, we propose MG-IQA (Multi-Granularity IQA), a multi-granularity reasoning framework that extends RL2R to jointly assess overall image quality and fine-grained quality attributes within a single inference pass. Our approach introduces three key innovations: (1) an attribute-aware prompting strategy that elicits structured multi-attribute reasoning from VLMs; (2) a multi-dimensional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.