Exploration of Deep Learning Based Recognition for Urdu Text

Sumaiya Fazal; Sheeraz Ahmed

arXiv:2508.13245·cs.CV·August 20, 2025

Exploration of Deep Learning Based Recognition for Urdu Text

Sumaiya Fazal, Sheeraz Ahmed

PDF

TL;DR

This paper presents a convolutional neural network-based system for recognizing Urdu text, addressing segmentation challenges and achieving high accuracy in component classification.

Contribution

The study introduces a CNN-based Urdu OCR system utilizing hierarchical classification and permutation-based dataset generation, improving recognition accuracy.

Findings

01

Achieved 99% accuracy in component classification.

02

Developed a dataset through permutation of characters.

03

Implemented a hierarchical neural network for recognition.

Abstract

Urdu is a cursive script language and has similarities with Arabic and many other South Asian languages. Urdu is difficult to classify due to its complex geometrical and morphological structure. Character classification can be processed further if segmentation technique is efficient, but due to context sensitivity in Urdu, segmentation-based recognition often results with high error rate. Our proposed approach for Urdu optical character recognition system is a component-based classification relying on automatic feature learning technique called convolutional neural network. CNN is trained and tested on Urdu text dataset, which is generated through permutation process of three characters and further proceeds to discarding unnecessary images by applying connected component technique in order to obtain ligature only. Hierarchical neural network is implemented with two levels to deal with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.