Efficient approach of using CNN based pretrained model in Bangla handwritten digit recognition
Muntarin Islam, Shabbir Ahmed Shuvo, Musarrat Saberin Nipun, Rejwan, Bin Sulaiman, Jannatul Nayeem, Zubaer Haque, Md Mostak Shaikh, Md Sakib Ullah, Sourav

TL;DR
This paper introduces a CNN-based pre-trained model combining Resnet-50, Inception-v3, and EfficientNetB0 for Bangla handwritten digit recognition, achieving 97% accuracy on the NumtaDB dataset, surpassing previous models.
Contribution
It presents a novel ensemble CNN approach using pre-trained models specifically tailored for Bangla digit recognition, addressing the complexity of Bengali script.
Findings
Achieved 97% accuracy on NumtaDB dataset
Outperformed existing models in Bangla digit recognition
Validated the model against prior research results
Abstract
Due to digitalization in everyday life, the need for automatically recognizing handwritten digits is increasing. Handwritten digit recognition is essential for numerous applications in various industries. Bengali ranks the fifth largest language in the world with 265 million speakers (Native and non-native combined) and 4 percent of the world population speaks Bengali. Due to the complexity of Bengali writing in terms of variety in shape, size, and writing style, researchers did not get better accuracy using Supervised machine learning algorithms to date. Moreover, fewer studies have been done on Bangla handwritten digit recognition (BHwDR). In this paper, we proposed a novel CNN-based pre-trained handwritten digit recognition model which includes Resnet-50, Inception-v3, and EfficientNetB0 on NumtaDB dataset of 17 thousand instances with 10 classes.. The Result outperformed the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Hand Gesture Recognition Systems
MethodsSoftmax · Dropout · Convolution · Dense Connections · Max Pooling · 1x1 Convolution · Auxiliary Classifier · Label Smoothing · Average Pooling · Inception-v3 Module
