Towards Detecting, Recognizing, and Parsing the Address Information from Bangla Signboard: A Deep Learning-based Approach
Hasan Murad, Mohammed Eunus Ali

TL;DR
This paper presents a comprehensive deep learning-based system for detecting, recognizing, correcting, and parsing Bangla address information from signboards, addressing the challenges of natural scene text extraction in low-resource languages.
Contribution
It introduces an end-to-end framework with novel datasets, comparative analysis of recognition models, and a transformer-based address text parser for Bangla signboards.
Findings
Created annotated and synthetic datasets for Bangla signboard text
Compared various CTC-based and Encoder-Decoder architectures for recognition
Developed a transformer-based address text correction and parsing system
Abstract
Retrieving textual information from natural scene images is an active research area in the field of computer vision with numerous practical applications. Detecting text regions and extracting text from signboards is a challenging problem due to special characteristics like reflecting lights, uneven illumination, or shadows found in real-life natural scene images. With the advent of deep learning-based methods, different sophisticated techniques have been proposed for text detection and text recognition from the natural scene. Though a significant amount of effort has been devoted to extracting natural scene text for resourceful languages like English, little has been done for low-resource languages like Bangla. In this research work, we have proposed an end-to-end system with deep learning-based models for efficiently detecting, recognizing, correcting, and parsing address information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Hand Gesture Recognition Systems
