A DNN Framework For Text Image Rectification From Planar Transformations

Chengzhe Yan; Jie Hu; Changshui Zhang

arXiv:1611.04298·cs.CV·November 15, 2016

A DNN Framework For Text Image Rectification From Planar Transformations

Chengzhe Yan, Jie Hu, Changshui Zhang

PDF

Open Access

TL;DR

This paper introduces a neural network framework designed to correct distorted text images caused by planar transformations, demonstrating robustness and effectiveness in image rectification without explicit segmentation supervision.

Contribution

The paper presents a novel DNN architecture for text image rectification and provides a new dataset for evaluating such models.

Findings

01

The model can learn geometric transformations without explicit segmentation labels.

02

The proposed architecture effectively restores planar transformations.

03

The new dataset supports further research in text image rectification.

Abstract

In this paper, a novel neural network architecture is proposed attempting to rectify text images with mild assumptions. A new dataset of text images is collected to verify our model and open to public. We explored the capability of deep neural network in learning geometric transformation and found the model could segment the text image without explicit supervised segmentation information. Experiments show the architecture proposed can restore planar transformations with wonderful robustness and effectiveness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Generative Adversarial Networks and Image Synthesis