Gradient Difference based approach for Text Localization in Compressed   domain

B.H. Shekar; Smitha M.L

arXiv:1502.03918·cs.CV·February 23, 2015·1 cites

Gradient Difference based approach for Text Localization in Compressed domain

B.H. Shekar, Smitha M.L

PDF

Open Access

TL;DR

This paper introduces a novel gradient difference method for text localization in compressed video frames and images, utilizing wavelet transforms, zero crossing, and morphological operations to accurately detect texts of various styles.

Contribution

The paper presents a new approach combining gradient difference and zero crossing techniques for scene text localization directly in compressed domain images.

Findings

01

Effective detection of texts of various sizes and fonts

02

High accuracy demonstrated on standard datasets

03

Method reduces false positives through combined analysis

Abstract

In this paper, we propose a gradient difference based approach to text localization in videos and scene images. The input video frame/ image is first compressed using multilevel 2-D wavelet transform. The edge information of the reconstructed image is found which is further used for finding the maximum gradient difference between the pixels and then the boundaries of the detected text blocks are computed using zero crossing technique. We perform logical AND operation of the text blocks obtained by gradient difference and the zero crossing technique followed by connected component analysis to eliminate the false positives. Finally, the morphological dilation operation is employed on the detected text blocks for scene text localization. The experimental results obtained on publicly available standard datasets illustrate that the proposed method can detect and localize the texts of various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Natural Language Processing Techniques