AViTExt: Automatic Video Text Extraction, A new Approach for video   content indexing Application

Baseem Bouaziz; Tarek Zlitni; Walid Mahdi

arXiv:1301.2173·cs.MM·January 11, 2013

AViTExt: Automatic Video Text Extraction, A new Approach for video content indexing Application

Baseem Bouaziz, Tarek Zlitni, Walid Mahdi

PDF

Open Access

TL;DR

This paper introduces AViTExt, a novel method for automatic video text detection that uses spatial-temporal analysis to identify and filter text regions, improving indexing accuracy.

Contribution

The paper presents a new spatial-temporal technique for video text detection that dynamically analyzes frame differences and filters regions based on temporal redundancy.

Findings

01

Achieved 89.39% precision in text detection

02

Achieved 90.19% recall in text detection

03

Effective for diverse video sequences

Abstract

In this paper, we propose a spatial temporal video-text detection technique which proceed in two principal steps:potential text region detection and a filtering process. In the first step we divide dynamically each pair of consecutive video frames into sub block in order to detect change. A significant difference between homologous blocks implies the appearance of an important object which may be a text region. The temporal redundancy is then used to filter these regions and forms an effective text region. The experimentation driven on a variety of video sequences shows the effectiveness of our approach by obtaining a 89,39% as precision rate and 90,19 as recall.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques