Generative Shape Models: Joint Text Recognition and Segmentation with   Very Little Training Data

Xinghua Lou; Ken Kansky; Wolfgang Lehrach; CC Laan; Bhaskara Marthi,; D. Scott Phoenix; Dileep George

arXiv:1611.02788·cs.CV·November 10, 2016·5 cites

Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data

Xinghua Lou, Ken Kansky, Wolfgang Lehrach, CC Laan, Bhaskara Marthi,, D. Scott Phoenix, Dileep George

PDF

Open Access

TL;DR

This paper introduces a generative shape model that excels in scene text recognition and segmentation, achieving high accuracy with significantly less training data and demonstrating robustness to various transformations.

Contribution

The paper presents a novel generative shape model that performs joint text recognition and segmentation with minimal training data, outperforming existing discriminative methods.

Findings

01

Achieves state-of-the-art scene text recognition results

02

Requires orders of magnitude fewer training images

03

More robust to affine and non-affine transformations

Abstract

We demonstrate that a generative model for object shapes can achieve state of the art results on challenging scene text recognition tasks, and with orders of magnitude fewer training images than required for competing discriminative methods. In addition to transcribing text from challenging images, our method performs fine-grained instance segmentation of characters. We show that our model is more robust to both affine transformations and non-affine deformations compared to previous approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques