UTRNet: High-Resolution Urdu Text Recognition In Printed Documents
Abdur Rahman, Arjun Ghosh, and Chetan Arora

TL;DR
This paper introduces UTRNet, a high-resolution hybrid CNN-RNN model for Urdu printed text recognition, supported by new large-scale datasets and an online OCR tool, achieving state-of-the-art results.
Contribution
The paper presents a novel high-resolution hybrid CNN-RNN architecture for Urdu OCR, along with large-scale real and synthetic datasets, and an integrated online OCR tool, advancing Urdu text recognition technology.
Findings
UTRNet achieves state-of-the-art accuracy on benchmark datasets.
Introduction of UTRSet-Real and UTRSet-Synth datasets enhances training and evaluation.
Development of an online Urdu OCR tool demonstrates practical applicability.
Abstract
In this paper, we propose a novel approach to address the challenges of printed Urdu text recognition using high-resolution, multi-scale semantic feature extraction. Our proposed UTRNet architecture, a hybrid CNN-RNN model, demonstrates state-of-the-art performance on benchmark datasets. To address the limitations of previous works, which struggle to generalize to the intricacies of the Urdu script and the lack of sufficient annotated real-world data, we have introduced the UTRSet-Real, a large-scale annotated real-world dataset comprising over 11,000 lines and UTRSet-Synth, a synthetic dataset with 20,000 lines closely resembling real-world and made corrections to the ground truth of the existing IIITH dataset, making it a more reliable resource for future research. We also provide UrduDoc, a benchmark dataset for Urdu text line detection in scanned documents. Additionally, we have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Vehicle License Plate Recognition
