A hypothesize-and-verify framework for Text Recognition using Deep   Recurrent Neural Networks

Anupama Ray; Sai Rajeswar; and Santanu Chaudhury

arXiv:1502.07540·cs.CV·February 27, 2015·1 cites

A hypothesize-and-verify framework for Text Recognition using Deep Recurrent Neural Networks

Anupama Ray, Sai Rajeswar, and Santanu Chaudhury

PDF

Open Access

TL;DR

This paper introduces a hypothesize-and-verify framework for text recognition that combines deep recurrent neural networks with a language model to improve accuracy by verifying multiple segmentation hypotheses.

Contribution

It presents a novel hybrid text recognition system that integrates multi-hypotheses segmentation with deep RNNs and language model verification to enhance recognition accuracy.

Findings

01

Improved recognition accuracy through hypothesis verification.

02

Effective elimination of segmentation errors.

03

Robust handling of long-range context in text recognition.

Abstract

Deep LSTM is an ideal candidate for text recognition. However text recognition involves some initial image processing steps like segmentation of lines and words which can induce error to the recognition system. Without segmentation, learning very long range context is difficult and becomes computationally intractable. Therefore, alternative soft decisions are needed at the pre-processing level. This paper proposes a hybrid text recognizer using a deep recurrent neural network with multiple layers of abstraction and long range context along with a language model to verify the performance of the deep neural network. In this paper we construct a multi-hypotheses tree architecture with candidate segments of line sequences from different segmentation algorithms at its different branches. The deep neural network is trained on perfectly segmented data and tests each of the candidate segments,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Topic Modeling

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory