Learning Generative Structure Prior for Blind Text Image Super-resolution
Xiaoming Li, Wangmeng Zuo, Chen Change Loy

TL;DR
This paper introduces a novel generative structure prior using StyleGAN and a codebook to improve blind text image super-resolution, especially for complex characters like Chinese, by focusing on character structures rather than recognition.
Contribution
The work proposes a structure prior based on StyleGAN and a codebook to better capture character details, enhancing super-resolution for diverse and complex text images.
Findings
Outperforms existing methods on synthetic datasets
Improves fidelity of complex character strokes
Demonstrates robustness on real-world data
Abstract
Blind text image super-resolution (SR) is challenging as one needs to cope with diverse font styles and unknown degradation. To address the problem, existing methods perform character recognition in parallel to regularize the SR task, either through a loss constraint or intermediate feature condition. Nonetheless, the high-level prior could still fail when encountering severe degradation. The problem is further compounded given characters of complex structures, e.g., Chinese characters that combine multiple pictographic or ideographic symbols into a single character. In this work, we present a novel prior that focuses more on the character structure. In particular, we learn to encapsulate rich and diverse structures in a StyleGAN and exploit such generative structure priors for restoration. To restrict the generative space of StyleGAN so that it obeys the structure of characters yet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Digital Media Forensic Detection
Methodsfail · Dense Connections · Convolution · Feedforward Network · R1 Regularization · Adaptive Instance Normalization · HuMan(Expedia)||How do I get a human at Expedia? · StyleGAN
