Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding
Partha Pratim Roy, Ayan Kumar Bhunia, Avirup Bhattacharyya, Umapada, Pal

TL;DR
This paper introduces a novel multi-script word spotting framework using dynamic shape coding and script-wise keyword generation, significantly improving text retrieval accuracy in complex scene images and video frames across multiple scripts.
Contribution
The paper presents a new multi-script word spotting method with dynamic shape coding and on-the-fly script-specific keyword generation, addressing limitations of existing script-specific approaches.
Findings
Effective retrieval of keywords in scene images and videos across multiple scripts.
Improved text alignment and confusion reduction through dynamic shape coding.
Successful application to English, Bangla, and Devanagari scripts with promising results.
Abstract
Retrieval of text information from natural scene images and video frames is a challenging task due to its inherent problems like complex character shapes, low resolution, background noise, etc. Available OCR systems often fail to retrieve such information in scene/video frames. Keyword spotting, an alternative way to retrieve information, performs efficient text searching in such scenarios. However, current word spotting techniques in scene/video images are script-specific and they are mainly developed for Latin script. This paper presents a novel word spotting framework using dynamic shape coding for text retrieval in natural scene image and video frames. The framework is designed to search query keyword from multiple scripts with the help of on-the-fly script-wise keyword generation for the corresponding script. We have used a two-stage word spotting approach using Hidden Markov Model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
