A hypothesize-and-verify framework for Text Recognition using Deep Recurrent Neural Networks
Anupama Ray, Sai Rajeswar, and Santanu Chaudhury

TL;DR
This paper introduces a hypothesize-and-verify framework for text recognition that combines deep recurrent neural networks with a language model to improve accuracy by verifying multiple segmentation hypotheses.
Contribution
It presents a novel hybrid text recognition system that integrates multi-hypotheses segmentation with deep RNNs and language model verification to enhance recognition accuracy.
Findings
Improved recognition accuracy through hypothesis verification.
Effective elimination of segmentation errors.
Robust handling of long-range context in text recognition.
Abstract
Deep LSTM is an ideal candidate for text recognition. However text recognition involves some initial image processing steps like segmentation of lines and words which can induce error to the recognition system. Without segmentation, learning very long range context is difficult and becomes computationally intractable. Therefore, alternative soft decisions are needed at the pre-processing level. This paper proposes a hybrid text recognizer using a deep recurrent neural network with multiple layers of abstraction and long range context along with a language model to verify the performance of the deep neural network. In this paper we construct a multi-hypotheses tree architecture with candidate segments of line sequences from different segmentation algorithms at its different branches. The deep neural network is trained on perfectly segmented data and tests each of the candidate segments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Topic Modeling
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
