Multi-Granularity Reasoning for Image Quality Assessment via Attribute-Aware Reinforcement Learning to Rank
Xiangyong Chen, Xiaochuan Lin, Haoran Liu, Xuan Li, Yichen Su, Xiangwei Guo

TL;DR
This paper introduces MG-IQA, a multi-granularity reasoning framework using reinforcement learning to assess overall and attribute-specific image quality simultaneously, improving accuracy and interpretability.
Contribution
It extends RL-based image quality assessment to multi-attribute evaluation with a novel prompting, reward, and training mechanism, enabling joint assessment across datasets.
Findings
Outperforms state-of-the-art in overall quality prediction with 2.1% SRCC improvement.
Achieves superior attribute-level assessment accuracy.
Provides interpretable, human-aligned quality descriptions.
Abstract
Recent advances in reasoning-induced image quality assessment (IQA) have demonstrated the power of reinforcement learning to rank (RL2R) for training vision-language models (VLMs) to assess perceptual quality. However, existing approaches operate at a single granularity, predicting only an overall quality score, while overlooking the multi-dimensional nature of human quality perception, which encompasses attributes such as sharpness, color fidelity, noise level, and compositional aesthetics. In this paper, we propose MG-IQA (Multi-Granularity IQA), a multi-granularity reasoning framework that extends RL2R to jointly assess overall image quality and fine-grained quality attributes within a single inference pass. Our approach introduces three key innovations: (1) an attribute-aware prompting strategy that elicits structured multi-attribute reasoning from VLMs; (2) a multi-dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
