Recurrent neural networks based Indic word-wise script identification using character-wise training
Rohun Tripathi, Aman Gill, Riccha Tripati

TL;DR
This paper introduces a novel RNN-based method for Indic script recognition that effectively uses character-level online data to predict word-level scripts, reducing data requirements and training time.
Contribution
It proposes a character-wise training approach with BLSTM RNNs for script identification, extending online models to offline data through stroke recovery.
Findings
Performance comparable to word-level trained models
Reduced data and training time for character-based models
Effective offline script prediction via stroke recovery
Abstract
This paper presents a novel methodology of Indic handwritten script recognition using Recurrent Neural Networks and addresses the problem of script recognition in poor data scenarios, such as when only character level online data is available. It is based on the hypothesis that curves of online character data comprise sufficient information for prediction at the word level. Online character data is used to train RNNs using BLSTM architecture which are then used to make predictions of online word level data. These prediction results on the test set are at par with prediction results of models trained with online word data, while the training of the character level model is much less data intensive and takes much less time. Performance for binary-script models and then 5 Indic script models are reported, along with comparison with HMM models.The system is extended for offline data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Vehicle License Plate Recognition
