TEXTS-Diff: TEXTS-Aware Diffusion Model for Real-World Text Image Super-Resolution
Haodong He, Xin Zhan, Yancheng Bai, Rui Lan, Lei Sun, Xiangxiang Chu

TL;DR
TEXTS-Diff introduces a novel diffusion-based approach and a large-scale real-world dataset to significantly improve text image super-resolution, restoring both background and textual details with high fidelity.
Contribution
The paper presents a new TEXTS-Aware Diffusion Model and the Real-Texts dataset, addressing data scarcity and improving text and background restoration in real-world scenarios.
Findings
Achieves state-of-the-art super-resolution performance.
Demonstrates superior generalization in complex scenes.
Effectively reduces distortions and hallucinations in text regions.
Abstract
Real-world text image super-resolution aims to restore overall visual quality and text legibility in images suffering from diverse degradations and text distortions. However, the scarcity of text image data in existing datasets results in poor performance on text regions. In addition, datasets consisting of isolated text samples limit the quality of background reconstruction. To address these limitations, we construct Real-Texts, a large-scale, high-quality dataset collected from real-world images, which covers diverse scenarios and contains natural text instances in both Chinese and English. Additionally, we propose the TEXTS-Aware Diffusion Model (TEXTS-Diff) to achieve high-quality generation in both background and textual regions. This approach leverages abstract concepts to improve the understanding of textual elements within visual scenes and concrete text regions to enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis · Image and Video Quality Assessment
