Easter2.0: Improving convolutional models for handwritten text recognition
Kartik Chaudhary, Raghav Bali

TL;DR
Easter2.0 is a CNN-based architecture for handwritten text recognition that achieves state-of-the-art results, incorporating novel modules and data augmentation techniques, and performs well in few-shot learning scenarios.
Contribution
The paper introduces Easter2.0, a CNN architecture with residual and Squeeze-and-Excitation modules, and a new data augmentation method TACO, improving handwritten text recognition performance.
Findings
Easter2.0 outperforms existing models on IAM dataset.
TACO augmentation enhances recognition accuracy.
Easter2.0 is effective for few-shot learning.
Abstract
Convolutional Neural Networks (CNN) have shown promising results for the task of Handwritten Text Recognition (HTR) but they still fall behind Recurrent Neural Networks (RNNs)/Transformer based models in terms of performance. In this paper, we propose a CNN based architecture that bridges this gap. Our work, Easter2.0, is composed of multiple layers of 1D Convolution, Batch Normalization, ReLU, Dropout, Dense Residual connection, Squeeze-and-Excitation module and make use of Connectionist Temporal Classification (CTC) loss. In addition to the Easter2.0 architecture, we propose a simple and effective data augmentation technique 'Tiling and Corruption (TACO)' relevant for the task of HTR/OCR. Our work achieves state-of-the-art results on IAM handwriting database when trained using only publicly available training data. In our experiments, we also present the impact of TACO augmentations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Image Processing and 3D Reconstruction
MethodsConvolution · Dropout · Batch Normalization
