A method for detecting text of arbitrary shapes in natural scenes that improves text spotting
Qitong Wang, Yi Zheng, Margrit Betke

TL;DR
This paper introduces UHT, a novel text detection method using UNet and Textfill for detecting arbitrarily shaped text in natural scenes, and combines it with recognition system ASTER to improve overall text spotting performance.
Contribution
The main novelty is the UHT detection component that accurately detects curved and multi-oriented text using heatmaps and polygonal boundaries, enhancing scene text understanding.
Findings
UHT outperforms existing methods on multiple datasets.
UHTA achieves at least 9.1% higher F-measure than state-of-the-art.
Effective detection of multilingual, curved, and rotated text in natural scenes.
Abstract
Understanding the meaning of text in images of natural scenes like highway signs or store front emblems is particularly challenging if the text is foreshortened in the image or the letters are artistically distorted. We introduce a pipeline-based text spotting framework that can both detect and recognize text in various fonts, shapes, and orientations in natural scene images with complicated backgrounds. The main contribution of our work is the text detection component, which we call UHT, short for UNet, Heatmap, and Textfill. UHT uses a UNet to compute heatmaps for candidate text regions and a textfill algorithm to produce tight polygonal boundaries around each word in the candidate text. Our method trains the UNet with groundtruth heatmaps that we obtain from text bounding polygons provided by groundtruth annotations. Our text spotting framework, called UHTA, combines UHT with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction
MethodsHeatmap
