A DNN Framework For Text Image Rectification From Planar Transformations
Chengzhe Yan, Jie Hu, Changshui Zhang

TL;DR
This paper introduces a neural network framework designed to correct distorted text images caused by planar transformations, demonstrating robustness and effectiveness in image rectification without explicit segmentation supervision.
Contribution
The paper presents a novel DNN architecture for text image rectification and provides a new dataset for evaluating such models.
Findings
The model can learn geometric transformations without explicit segmentation labels.
The proposed architecture effectively restores planar transformations.
The new dataset supports further research in text image rectification.
Abstract
In this paper, a novel neural network architecture is proposed attempting to rectify text images with mild assumptions. A new dataset of text images is collected to verify our model and open to public. We explored the capability of deep neural network in learning geometric transformation and found the model could segment the text image without explicit supervised segmentation information. Experiments show the architecture proposed can restore planar transformations with wonderful robustness and effectiveness.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Generative Adversarial Networks and Image Synthesis
