Guiding Perception-Reasoning Closer to Human in Blind Image Quality Assessment
Yuan Li, Yahan Yu, Youyuan Lin, Yong-Hao Yang, Chenhui Chu, Shin'ya Nishida

TL;DR
This paper proposes a reinforcement learning approach guided by human annotations to develop a blind image quality assessment model that mimics human perception and reasoning, achieving comparable accuracy and improved interpretability.
Contribution
It introduces a human-guided reinforcement learning framework that enhances both the accuracy and interpretability of BIQA models by incorporating human perception-reasoning data.
Findings
Achieves state-of-the-art correlation metrics in BIQA.
Improves alignment with human reasoning chains as measured by ROUGE-1.
Demonstrates the model's ability to generate human-like explanations.
Abstract
Humans assess image quality through a perception-reasoning cascade, integrating sensory cues with implicit reasoning to form self-consistent judgments. In this work, we investigate how a model can acquire both human-like and self-consistent reasoning capability for blind image quality assessment (BIQA). We first collect human evaluation data that capture several aspects of human perception-reasoning pipeline. Then, we adopt reinforcement learning, using human annotations as reward signals to guide the model toward human-like perception and reasoning. To enable the model to internalize self-consistent reasoning capability, we design a reward that drives the model to infer the image quality purely from self-generated descriptions. Empirically, our approach achieves score prediction performance comparable to state-of-the-art BIQA systems under general metrics, including Pearson and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Image Enhancement Techniques
