Word Searching in Scene Image and Video Frame in Multi-Script Scenario   using Dynamic Shape Coding

Partha Pratim Roy; Ayan Kumar Bhunia; Avirup Bhattacharyya; Umapada; Pal

arXiv:1708.05529·cs.CV·July 31, 2018

Word Searching in Scene Image and Video Frame in Multi-Script Scenario using Dynamic Shape Coding

Partha Pratim Roy, Ayan Kumar Bhunia, Avirup Bhattacharyya, Umapada, Pal

PDF

TL;DR

This paper introduces a novel multi-script word spotting framework using dynamic shape coding and script-wise keyword generation, significantly improving text retrieval accuracy in complex scene images and video frames across multiple scripts.

Contribution

The paper presents a new multi-script word spotting method with dynamic shape coding and on-the-fly script-specific keyword generation, addressing limitations of existing script-specific approaches.

Findings

01

Effective retrieval of keywords in scene images and videos across multiple scripts.

02

Improved text alignment and confusion reduction through dynamic shape coding.

03

Successful application to English, Bangla, and Devanagari scripts with promising results.

Abstract

Retrieval of text information from natural scene images and video frames is a challenging task due to its inherent problems like complex character shapes, low resolution, background noise, etc. Available OCR systems often fail to retrieve such information in scene/video frames. Keyword spotting, an alternative way to retrieve information, performs efficient text searching in such scenarios. However, current word spotting techniques in scene/video images are script-specific and they are mainly developed for Latin script. This paper presents a novel word spotting framework using dynamic shape coding for text retrieval in natural scene image and video frames. The framework is designed to search query keyword from multiple scripts with the help of on-the-fly script-wise keyword generation for the corresponding script. We have used a two-stage word spotting approach using Hidden Markov Model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.