Arbitrary Shape Text Detection via Segmentation with Probability Maps
Shi-Xue Zhang, Xiaobin Zhu, Lei Chen, Jie-Bo Hou, Xu-Cheng Yin

TL;DR
This paper introduces a robust segmentation-based method using multiple probability maps and iterative learning to accurately detect arbitrarily shaped text in images, overcoming annotation limitations.
Contribution
It proposes a novel probability map approach with Sigmoid Alpha Functions and an iterative model to improve text detection accuracy with coarse annotations.
Findings
Achieves state-of-the-art detection accuracy on benchmarks.
Effectively handles complex text shapes and orientations.
Improves robustness against annotation inaccuracies.
Abstract
Arbitrary shape text detection is a challenging task due to the significantly varied sizes and aspect ratios, arbitrary orientations or shapes, inaccurate annotations, etc. Due to the scalability of pixel-level prediction, segmentation-based methods can adapt to various shape texts and hence attracted considerable attention recently. However, accurate pixel-level annotations of texts are formidable, and the existing datasets for scene text detection only provide coarse-grained boundary annotations. Consequently, numerous misclassified text pixels or background pixels inside annotations always exist, degrading the performance of segmentation-based text detection methods. Generally speaking, whether a pixel belongs to text or not is highly related to the distance with the adjacent annotation boundary. With this observation, in this paper, we propose an innovative and robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction
