Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution
Yuxuan Zhou, Liangcai Gao, Zhi Tang, Baole Wei

TL;DR
This paper introduces RGDiffSR, a diffusion-based super-resolution model guided by recognition cues, significantly improving the quality and recognition accuracy of low-resolution scene text images, especially in challenging cases.
Contribution
The paper proposes a novel recognition-guided diffusion model for scene text super-resolution, enhancing generative diversity and fidelity over previous CNN-based methods.
Findings
Outperforms state-of-the-art methods on TextZoom dataset
Achieves higher recognition accuracy in low-quality images
Demonstrates robustness in severely blurred scenarios
Abstract
Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR). Previous methods predominantly employ discriminative Convolutional Neural Networks (CNNs) augmented with diverse forms of text guidance to address this issue. Nevertheless, they remain deficient when confronted with severely blurred images, due to their insufficient generation capability when little structural or semantic information can be extracted from original images. Therefore, we introduce RGDiffSR, a Recognition-Guided Diffusion model for scene text image Super-Resolution, which exhibits great generative diversity and fidelity even in challenging scenarios. Moreover, we propose a Recognition-Guided Denoising Network, to guide the diffusion model generating LR-consistent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Image Processing Techniques and Applications
MethodsDiffusion
