TL;DR
This paper explores deep learning approaches for handwritten recognition of Cyrillic text in Kazakh and Russian, introducing new models and a novel dataset of handwritten Cyrillic words for improved recognition accuracy.
Contribution
The authors propose new deep learning models and create a new Cyrillic handwritten dataset for Kazakh and Russian languages, addressing data scarcity in this area.
Findings
Deep CNNs combined with MLP improve recognition accuracy.
CNN-RNN hybrid models outperform traditional approaches.
New dataset enhances training and evaluation of Cyrillic handwriting recognition.
Abstract
This article discusses the problem of handwriting recognition in Kazakh and Russian languages. This area is poorly studied since in the literature there are almost no works in this direction. We have tried to describe various approaches and achievements of recent years in the development of handwritten recognition models in relation to Cyrillic graphics. The first model uses deep convolutional neural networks (CNNs) for feature extraction and a fully connected multilayer perceptron neural network (MLP) for word classification. The second model, called SimpleHTR, uses CNN and recurrent neural network (RNN) layers to extract information from images. We also proposed the Bluechet and Puchserver models to compare the results. Due to the lack of available open datasets in Russian and Kazakh languages, we carried out work to collect data that included handwritten names of countries and cities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
