Illegible Text to Readable Text: An Image-to-Image Transformation using Conditional Sliced Wasserstein Adversarial Networks
Mostafa Karimi, Gopalkrishna Veni, Yen-Yun Yu

TL;DR
This paper introduces HW2MP-GAN, a novel image-to-image translation model using conditional Sliced Wasserstein adversarial networks to convert illegible handwritten text images into clear machine-print images, improving recognition accuracy.
Contribution
The paper proposes a new GAN architecture incorporating SWD and U-Net for high-quality handwritten-to-machine-print image translation, outperforming existing models.
Findings
Outperforms baseline cGAN models by 30 in FHD
Achieves 0.6 lower Levenshtein distance
Improves word accuracy by 39% on IAM database
Abstract
Automatic text recognition from ancient handwritten record images is an important problem in the genealogy domain. However, critical challenges such as varying noise conditions, vanishing texts, and variations in handwriting make the recognition task difficult. We tackle this problem by developing a handwritten-to-machine-print conditional Generative Adversarial network (HW2MP-GAN) model that formulates handwritten recognition as a text-Image-to-text-Image translation problem where a given image, typically in an illegible form, is converted into another image, close to its machine-print form. The proposed model consists of three-components including a generator, and word-level and character-level discriminators. The model incorporates Sliced Wasserstein distance (SWD) and U-Net architectures in HW2MP-GAN for better quality image-to-image transformation. Our experiments reveal that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Generative Adversarial Networks and Image Synthesis
MethodsConcatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · U-Net
