Q-Hawkeye: Reliable Visual Policy Optimization for Image Quality Assessment

Wulin Xie; Rui Dai; Ruidong Ding; Kaikui Liu; Xiangxiang Chu; Xinwen Hou; Jie Wen

arXiv:2601.22920·cs.CV·February 17, 2026

Q-Hawkeye: Reliable Visual Policy Optimization for Image Quality Assessment

Wulin Xie, Rui Dai, Ruidong Ding, Kaikui Liu, Xiangxiang Chu, Xinwen Hou, Jie Wen

PDF

Open Access

TL;DR

Q-Hawkeye introduces a reliable RL-based framework for image quality assessment that uses uncertainty estimation and perception grounding to improve stability and accuracy over existing methods.

Contribution

It proposes a novel uncertainty-aware and perception-grounded optimization framework for RL-based IQA, addressing stability and perceptual reliability issues.

Findings

01

Outperforms state-of-the-art IQA methods in experiments.

02

Demonstrates better generalization across multiple datasets.

03

Stabilizes policy optimization through uncertainty reweighting.

Abstract

Image Quality Assessment (IQA) predicts perceptual quality scores consistent with human judgments. Recent RL-based IQA methods built on MLLMs focus on generating visual quality descriptions and scores, ignoring two key reliability limitations: (i) although the model's prediction stability varies significantly across training samples, existing GRPO-based methods apply uniform advantage weighting, thereby amplifying noisy signals from unstable samples in gradient updates; (ii) most works emphasize text-grounded reasoning over images while overlooking the model's visual perception ability of image content. In this paper, we propose Q-Hawkeye, an RL-based reliable visual policy optimization framework that redesigns the learning signal through unified Uncertainty-Aware Dynamic Optimization and Perception-Aware Optimization. Q-Hawkeye estimates predictive uncertainty using the variance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Visual Attention and Saliency Detection · Image Enhancement Techniques