Multi-Oriented Text Detection and Verification in Video Frames and Scene Images
Aneeshan Sain, Ayan Kumar Bhunia, Partha Pratim Roy, Umapada Pal

TL;DR
This paper introduces a novel multi-oriented text detection method in videos and images using Fourier-Laplacian filtering and HMM verification, effectively handling curved and oblique text in multiple scripts.
Contribution
The approach uniquely combines frequency domain filtering, skeletonization, and HMM verification for robust multi-oriented text detection, including curved text, in diverse scripts.
Findings
Outperforms existing methods in multi-oriented text detection
Effective in detecting curved and oblique text in videos and images
Successfully applied to English, Chinese, Devanagari, and Bengali scripts
Abstract
In this paper, we bring forth a novel approach of video text detection using Fourier-Laplacian filtering in the frequency domain that includes a verification technique using Hidden Markov Model (HMM). The proposed approach deals with the text region appearing not only in horizontal or vertical directions, but also in any other oblique or curved orientation in the image. Until now only a few methods have been proposed that look into curved text detection in video frames, wherein lies our novelty. In our approach, we first apply Fourier-Laplacian transform on the image followed by an ideal Laplacian-Gaussian filtering. Thereafter K-means clustering is employed to obtain the asserted text areas depending on a maximum difference map. Next, the obtained connected components (CC) are skeletonized to distinguish various text strings. Complex components are disintegrated into simpler ones…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsk-Means Clustering
