Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids
Saad Bin Ahmed, Saeeda Naz, Muhammad Imran Razzak, and Rubiyah Yusof

TL;DR
This paper introduces a novel deep convolutional linear pyramid method for analyzing cursive scene text, specifically Arabic, achieving high accuracy in recognition tasks.
Contribution
It proposes a new approach using linear pyramids and empirically selected kernels for improved cursive scene text analysis, especially for Arabic script.
Findings
Achieved 0.17% error rate on Arabic scene text recognition
Utilized linear pyramids as features for scene text analysis
Demonstrated effectiveness on EASTR-42k dataset
Abstract
The camera captured images have various aspects to investigate. Generally, the emphasis of research depends on the interesting regions. Sometimes the focus could be on color segmentation, object detection or scene text analysis. The image analysis, visibility and layout analysis are the tasks easier for humans as suggested by behavioral trait of humans, but in contrast when these same tasks are supposed to perform by machines then it seems to be challenging. The learning machines always learn from the properties associated to provided samples. The numerous approaches are designed in recent years for scene text extraction and recognition and the efforts are underway to improve the accuracy. The convolutional approach provided reasonable results on non-cursive text analysis appeared in natural images. The work presented in this manuscript exploited the strength of linear pyramids by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Vehicle License Plate Recognition
