AViTExt: Automatic Video Text Extraction, A new Approach for video content indexing Application
Baseem Bouaziz, Tarek Zlitni, Walid Mahdi

TL;DR
This paper introduces AViTExt, a novel method for automatic video text detection that uses spatial-temporal analysis to identify and filter text regions, improving indexing accuracy.
Contribution
The paper presents a new spatial-temporal technique for video text detection that dynamically analyzes frame differences and filters regions based on temporal redundancy.
Findings
Achieved 89.39% precision in text detection
Achieved 90.19% recall in text detection
Effective for diverse video sequences
Abstract
In this paper, we propose a spatial temporal video-text detection technique which proceed in two principal steps:potential text region detection and a filtering process. In the first step we divide dynamically each pair of consecutive video frames into sub block in order to detect change. A significant difference between homologous blocks implies the appearance of an important object which may be a text region. The temporal redundancy is then used to filter these regions and forms an effective text region. The experimentation driven on a variety of video sequences shows the effectiveness of our approach by obtaining a 89,39% as precision rate and 90,19 as recall.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
