Guiding Perception-Reasoning Closer to Human in Blind Image Quality Assessment

Yuan Li; Yahan Yu; Youyuan Lin; Yong-Hao Yang; Chenhui Chu; Shin'ya Nishida

arXiv:2512.16484·cs.CV·December 19, 2025

Guiding Perception-Reasoning Closer to Human in Blind Image Quality Assessment

Yuan Li, Yahan Yu, Youyuan Lin, Yong-Hao Yang, Chenhui Chu, Shin'ya Nishida

PDF

Open Access

TL;DR

This paper proposes a reinforcement learning approach guided by human annotations to develop a blind image quality assessment model that mimics human perception and reasoning, achieving comparable accuracy and improved interpretability.

Contribution

It introduces a human-guided reinforcement learning framework that enhances both the accuracy and interpretability of BIQA models by incorporating human perception-reasoning data.

Findings

01

Achieves state-of-the-art correlation metrics in BIQA.

02

Improves alignment with human reasoning chains as measured by ROUGE-1.

03

Demonstrates the model's ability to generate human-like explanations.

Abstract

Humans assess image quality through a perception-reasoning cascade, integrating sensory cues with implicit reasoning to form self-consistent judgments. In this work, we investigate how a model can acquire both human-like and self-consistent reasoning capability for blind image quality assessment (BIQA). We first collect human evaluation data that capture several aspects of human perception-reasoning pipeline. Then, we adopt reinforcement learning, using human annotations as reward signals to guide the model toward human-like perception and reasoning. To enable the model to internalize self-consistent reasoning capability, we design a reward that drives the model to infer the image quality purely from self-generated descriptions. Empirically, our approach achieves score prediction performance comparable to state-of-the-art BIQA systems under general metrics, including Pearson and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Image Enhancement Techniques