VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

Tianhe Wu; Jian Zou; Jie Liang; Lei Zhang; Kede Ma

arXiv:2505.14460·cs.CV·October 22, 2025

VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank

Tianhe Wu, Jian Zou, Jie Liang, Lei Zhang, Kede Ma

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces VisualQuality-R1, a reinforcement learning-based no-reference image quality assessment model that leverages visual reasoning, outperforming existing models and providing human-aligned quality descriptions across multiple datasets.

Contribution

We propose VisualQuality-R1, a novel reasoning-induced NR-IQA model trained with reinforcement learning to rank, capable of human-aligned quality descriptions and multi-dataset training.

Findings

01

Outperforms existing NR-IQA models in accuracy.

02

Generates human-aligned, context-rich quality descriptions.

03

Supports multi-dataset training without perceptual scale realignment.

Abstract

DeepSeek-R1 has demonstrated remarkable effectiveness in incentivizing reasoning and generalization capabilities of large language models (LLMs) through reinforcement learning. Nevertheless, the potential of reasoning-induced computation has not been thoroughly explored in the context of image quality assessment (IQA), a task depending critically on visual reasoning. In this paper, we introduce VisualQuality-R1, a reasoning-induced no-reference IQA (NR-IQA) model, and we train it with reinforcement learning to rank, a learning algorithm tailored to the intrinsically relative nature of visual quality. Specifically, for a pair of images, we employ group relative policy optimization to generate multiple quality scores for each image. These estimates are used to compute comparative probabilities of one image having higher quality than the other under the Thurstone model. Rewards for each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tianhewu/visualquality-r1
pytorchOfficial

Models

🤗
TianheWu/VisualQuality-R1-7B
model· 1.7k dl· ♡ 10
1.7k dl♡ 10

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection