RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution
Yushuai Song, Weize Quan, Weining Wang, Jiahui Sun, Jing Liu, Meng Li, Pengbin Yu, Zhentao Chen, Wei Shen, Lunxi Yuan, Dong-ming Yan

TL;DR
RefReward-SR introduces a novel LR-conditioned reward model leveraging multimodal language models to improve perceptual quality and semantic consistency in super-resolution, aligning better with human preferences.
Contribution
It proposes a new LR-conditioned reward framework for SR that uses multimodal language models and constructs a large-scale dataset for preference learning, advancing preference-aligned super-resolution.
Findings
Achieves better alignment with human perceptual judgments
Produces more semantically consistent and natural high-resolution images
Outperforms existing methods in perceptual quality metrics
Abstract
Recent advances in generative super-resolution (SR) have greatly improved visual realism, yet existing evaluation and optimization frameworks remain misaligned with human perception. Full-Reference and No-Reference metrics often fail to reflect perceptual preference, either penalizing semantically plausible details due to pixel misalignment or favoring visually sharp but inconsistent artifacts. Moreover, most SR methods rely on ground-truth (GT)-dependent distribution matching, which does not necessarily correspond to human judgments. In this work, we propose RefReward-SR, a low-resolution (LR) reference-aware reward model for preference-aligned SR. Instead of relying on GT supervision or NR evaluation, RefReward-SR assesses high-resolution (HR) reconstructions conditioned on their LR inputs, treating the LR image as a semantic anchor. Leveraging the visual-linguistic priors of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment
