RFSR: Improving ISR Diffusion Models via Reward Feedback Learning

Xiaopeng Sun; Qinwei Lin; Yu Gao; Yujie Zhong; Chengjian Feng; Dengjie; Li; Zheng Zhao; Jie Hu; Lin Ma

arXiv:2412.03268·cs.CV·December 5, 2024

RFSR: Improving ISR Diffusion Models via Reward Feedback Learning

Xiaopeng Sun, Qinwei Lin, Yu Gao, Yujie Zhong, Chengjian Feng, Dengjie, Li, Zheng Zhao, Jie Hu, Lin Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces RFSR, a reward feedback learning approach that enhances image super-resolution diffusion models by applying a timestep-aware training strategy with regularization, leading to improved perceptual and aesthetic image quality.

Contribution

The paper proposes a novel reward feedback learning method with a timestep-aware strategy and regularization, which can be integrated into existing diffusion-based ISR models to improve image quality.

Findings

01

Significant improvement in perceptual quality of SR images.

02

Enhanced aesthetic appeal of generated images.

03

Method is plug-and-play and compatible with existing models.

Abstract

Generative diffusion models (DM) have been extensively utilized in image super-resolution (ISR). Most of the existing methods adopt the denoising loss from DDPMs for model optimization. We posit that introducing reward feedback learning to finetune the existing models can further improve the quality of the generated images. In this paper, we propose a timestep-aware training strategy with reward feedback learning. Specifically, in the initial denoising stages of ISR diffusion, we apply low-frequency constraints to super-resolution (SR) images to maintain structural stability. In the later denoising stages, we use reward feedback learning to improve the perceptual and aesthetic quality of the SR images. In addition, we incorporate Gram-KL regularization to alleviate stylization caused by reward hacking. Our method can be integrated into any diffusion-based ISR model in a plug-and-play…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sxpro/rfsr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems

MethodsADaptive gradient method with the OPTimal convergence rate · Diffusion