Learning Generative Structure Prior for Blind Text Image   Super-resolution

Xiaoming Li; Wangmeng Zuo; Chen Change Loy

arXiv:2303.14726·cs.CV·March 28, 2023·1 cites

Learning Generative Structure Prior for Blind Text Image Super-resolution

Xiaoming Li, Wangmeng Zuo, Chen Change Loy

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel generative structure prior using StyleGAN and a codebook to improve blind text image super-resolution, especially for complex characters like Chinese, by focusing on character structures rather than recognition.

Contribution

The work proposes a structure prior based on StyleGAN and a codebook to better capture character details, enhancing super-resolution for diverse and complex text images.

Findings

01

Outperforms existing methods on synthetic datasets

02

Improves fidelity of complex character strokes

03

Demonstrates robustness on real-world data

Abstract

Blind text image super-resolution (SR) is challenging as one needs to cope with diverse font styles and unknown degradation. To address the problem, existing methods perform character recognition in parallel to regularize the SR task, either through a loss constraint or intermediate feature condition. Nonetheless, the high-level prior could still fail when encountering severe degradation. The problem is further compounded given characters of complex structures, e.g., Chinese characters that combine multiple pictographic or ideographic symbols into a single character. In this work, we present a novel prior that focuses more on the character structure. In particular, we learn to encapsulate rich and diverse structures in a StyleGAN and exploit such generative structure priors for restoration. To restrict the generative space of StyleGAN so that it obeys the structure of characters yet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

csxmli2016/marconet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image Processing Techniques and Applications · Digital Media Forensic Detection

Methodsfail · Dense Connections · Convolution · Feedforward Network · R1 Regularization · Adaptive Instance Normalization · HuMan(Expedia)||How do I get a human at Expedia? · StyleGAN