Image Pre-processing on NumtaDB for Bengali Handwritten Digit   Recognition

Ovi Paul

arXiv:2008.07853·cs.CV·August 19, 2020

Image Pre-processing on NumtaDB for Bengali Handwritten Digit Recognition

Ovi Paul

PDF

TL;DR

This paper aims to establish effective image pre-processing benchmarks for Bengali handwritten digit recognition using NumtaDB, addressing the lack of pre-processed data for improved machine learning model accuracy.

Contribution

It introduces pre-processing benchmarks for Bengali digits in NumtaDB, facilitating better model performance and providing a foundation for future research in Bengali handwritten digit recognition.

Findings

01

Identified effective pre-processing techniques for Bengali digits

02

Established benchmark accuracy levels for various models

03

Enhanced recognition performance on the NumtaDB dataset

Abstract

NumtaDB is by far the largest data-set collection for handwritten digits in Bengali. This is a diverse dataset containing more than 85000 images. But this diversity also makes this dataset very difficult to work with. The goal of this paper is to find the benchmark for pre-processed images which gives good accuracy on any machine learning models. The reason being, there are no available pre-processed data for Bengali digit recognition to work with like the English digits for MNIST.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.