RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution

Yushuai Song; Weize Quan; Weining Wang; Jiahui Sun; Jing Liu; Meng Li; Pengbin Yu; Zhentao Chen; Wei Shen; Lunxi Yuan; Dong-ming Yan

arXiv:2603.24198·cs.CV·March 26, 2026

RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution

Yushuai Song, Weize Quan, Weining Wang, Jiahui Sun, Jing Liu, Meng Li, Pengbin Yu, Zhentao Chen, Wei Shen, Lunxi Yuan, Dong-ming Yan

PDF

Open Access

TL;DR

RefReward-SR introduces a novel LR-conditioned reward model leveraging multimodal language models to improve perceptual quality and semantic consistency in super-resolution, aligning better with human preferences.

Contribution

It proposes a new LR-conditioned reward framework for SR that uses multimodal language models and constructs a large-scale dataset for preference learning, advancing preference-aligned super-resolution.

Findings

01

Achieves better alignment with human perceptual judgments

02

Produces more semantically consistent and natural high-resolution images

03

Outperforms existing methods in perceptual quality metrics

Abstract

Recent advances in generative super-resolution (SR) have greatly improved visual realism, yet existing evaluation and optimization frameworks remain misaligned with human perception. Full-Reference and No-Reference metrics often fail to reflect perceptual preference, either penalizing semantically plausible details due to pixel misalignment or favoring visually sharp but inconsistent artifacts. Moreover, most SR methods rely on ground-truth (GT)-dependent distribution matching, which does not necessarily correspond to human judgments. In this work, we propose RefReward-SR, a low-resolution (LR) reference-aware reward model for preference-aligned SR. Instead of relying on GT supervision or NR evaluation, RefReward-SR assesses high-resolution (HR) reconstructions conditioned on their LR inputs, treating the LR image as a semantic anchor. Leveraging the visual-linguistic priors of a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment