GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System
Lalita Kumari, Sukhdeep Singh, Vaibhav Varish Singh Rathore, Anuj, Sharma

TL;DR
GatedLexiconNet is an end-to-end handwritten paragraph recognition system that uses gated convolutional layers and attention-based line segmentation to improve accuracy over existing methods.
Contribution
It introduces gated convolutional layers and an attention mechanism for internal line segmentation in an end-to-end recognition framework, extending the existing LexiconNet model.
Findings
Achieved character error rates of 2.27% on IAM, 0.9% on RIMES, and 2.13% on READ-16 datasets.
Achieved word error rates of 5.73% on IAM, 2.76% on RIMES, and 6.52% on READ-16 datasets.
Demonstrated improved recognition performance over previous models.
Abstract
The Handwritten Text Recognition problem has been a challenge for researchers for the last few decades, especially in the domain of computer vision, a subdomain of pattern recognition. Variability of texts amongst writers, cursiveness, and different font styles of handwritten texts with degradation of historical text images make it a challenging problem. Recognizing scanned document images in neural network-based systems typically involves a two-step approach: segmentation and recognition. However, this method has several drawbacks. These shortcomings encompass challenges in identifying text regions, analyzing layout diversity within pages, and establishing accurate ground truth segmentation. Consequently, these processes are prone to errors, leading to bottlenecks in achieving high recognition accuracies. Thus, in this study, we present an end-to-end paragraph recognition system that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Text and Document Classification Technologies · Natural Language Processing Techniques
