Robust Text Detection in Natural Scene Images
Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang, Hong-Wei Hao

TL;DR
This paper presents a robust and accurate method for detecting text in natural scene images, utilizing MSERs, clustering, and machine learning techniques to outperform previous state-of-the-art methods.
Contribution
The paper introduces a novel self-training distance metric learning algorithm and an integrated system that significantly improves text detection accuracy in natural scenes.
Findings
F measure over 76% on ICDAR 2011 dataset, surpassing previous 71%.
Outperforms other methods with over 9% increase in F measure on multilingual datasets.
System is available online for demonstration and further testing.
Abstract
Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks. In this paper, we propose an accurate and robust method for detecting texts in natural scene images. A fast and effective pruning algorithm is designed to extract Maximally Stable Extremal Regions (MSERs) as character candidates using the strategy of minimizing regularized variations. Character candidates are grouped into text candidates by the ingle-link clustering algorithm, where distance weights and threshold of the clustering algorithm are learned automatically by a novel self-training distance metric learning algorithm. The posterior probabilities of text candidates corresponding to non-text are estimated with an character classifier; text candidates with high probabilities are then eliminated and finally texts are identified with a text classifier. The proposed system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
