A Hybrid Deep Learning Model for Arabic Text Recognition
Mohammad Fasha, Bassam Hammo, Nadim Obeid, Jabir Widian

TL;DR
This paper introduces a hybrid deep learning model capable of recognizing printed Arabic text across various fonts, including handwritten styles, without character segmentation, tested on a large diverse dataset.
Contribution
The paper presents a novel hybrid deep learning architecture for Arabic text recognition that handles multiple font types and does not require character segmentation.
Findings
Achieved high accuracy in recognizing Arabic characters and words.
Demonstrated robustness on unseen font data.
Provided publicly available datasets and tools for further research.
Abstract
Arabic text recognition is a challenging task because of the cursive nature of Arabic writing system, its joint writing scheme, the large number of ligatures and many other challenges. Deep Learning DL models achieved significant progress in numerous domains including computer vision and sequence modelling. This paper presents a model that can recognize Arabic text that was printed using multiple font types including fonts that mimic Arabic handwritten scripts. The proposed model employs a hybrid DL network that can recognize Arabic printed text without the need for character segmentation. The model was tested on a custom dataset comprised of over two million word samples that were generated using 18 different Arabic font types. The objective of the testing process was to assess the model capability in recognizing a diverse set of Arabic fonts representing a varied cursive styles. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Computer Science and Engineering · Vehicle License Plate Recognition
MethodsSigmoid Activation · Tanh Activation · Convolution · CNN Bidirectional LSTM · Long Short-Term Memory · Bidirectional LSTM
