RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution

Haodong He; Yancheng Bai; Rui Lan; Xu Duan; Lei Sun; Xiangxiang Chu; and Gui-Song Xia

arXiv:2508.16158·cs.CV·August 25, 2025

RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution

Haodong He, Yancheng Bai, Rui Lan, Xu Duan, Lei Sun, Xiangxiang Chu, and Gui-Song Xia

PDF

TL;DR

RAGSR introduces a regional attention mechanism that enhances image super-resolution by explicitly incorporating localized textual descriptions, leading to more detailed and contextually consistent high-resolution images.

Contribution

The paper proposes a novel regional attention guided super-resolution method that explicitly encodes localized textual information to improve detail and accuracy in SR results.

Findings

01

Outperforms existing methods in perceptual quality and detail preservation

02

Effectively localizes object regions and encodes fine-grained descriptions

03

Achieves superior results on benchmark datasets

Abstract

The rich textual information of large vision-language models (VLMs) combined with the powerful generative prior of pre-trained text-to-image (T2I) diffusion models has achieved impressive performance in single-image super-resolution (SISR). However, existing methods still face significant challenges in generating clear and accurate regional details, particularly in scenarios involving multiple objects. This challenge primarily stems from a lack of fine-grained regional descriptions and the models' insufficient ability to capture complex prompts. To address these limitations, we propose a Regional Attention Guided Super-Resolution (RAGSR) method that explicitly extracts localized fine-grained information and effectively encodes it through a novel regional attention mechanism, enabling both enhanced detail and overall visually coherent SR results. Specifically, RAGSR localizes object…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.