PEAN: A Diffusion-Based Prior-Enhanced Attention Network for Scene Text Image Super-Resolution
Zuoyan Zhao, Hui Xue, Pengfei Fang, Shipeng Zhu

TL;DR
PEAN introduces a diffusion-based prior-enhanced attention network that significantly improves scene text image super-resolution by integrating local-global dependence understanding and semantic text priors, achieving state-of-the-art results.
Contribution
The paper proposes a novel PEAN model combining attention modulation and diffusion-based text priors for enhanced scene text image super-resolution.
Findings
PEAN achieves new SOTA on TextZoom benchmark.
The diffusion-based text prior significantly improves SR quality.
Multi-task learning enhances model performance.
Abstract
Scene text image super-resolution (STISR) aims at simultaneously increasing the resolution and readability of low-resolution scene text images, thus boosting the performance of the downstream recognition task. Two factors in scene text images, visual structure and semantic information, affect the recognition performance significantly. To mitigate the effects from these factors, this paper proposes a Prior-Enhanced Attention Network (PEAN). Specifically, an attention-based modulation module is leveraged to understand scene text images by neatly perceiving the local and global dependence of images, despite the shape of the text. Meanwhile, a diffusion-based module is developed to enhance the text prior, hence offering better guidance for the SR network to generate SR images with higher semantic accuracy. Additionally, a multi-task learning paradigm is employed to optimize the network,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Signal Denoising Methods · Advanced Image Fusion Techniques
