Arbitrary Shape Text Detection via Segmentation with Probability Maps

Shi-Xue Zhang; Xiaobin Zhu; Lei Chen; Jie-Bo Hou; Xu-Cheng Yin

arXiv:2208.12419·cs.CV·August 29, 2022

Arbitrary Shape Text Detection via Segmentation with Probability Maps

Shi-Xue Zhang, Xiaobin Zhu, Lei Chen, Jie-Bo Hou, Xu-Cheng Yin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a robust segmentation-based method using multiple probability maps and iterative learning to accurately detect arbitrarily shaped text in images, overcoming annotation limitations.

Contribution

It proposes a novel probability map approach with Sigmoid Alpha Functions and an iterative model to improve text detection accuracy with coarse annotations.

Findings

01

Achieves state-of-the-art detection accuracy on benchmarks.

02

Effectively handles complex text shapes and orientations.

03

Improves robustness against annotation inaccuracies.

Abstract

Arbitrary shape text detection is a challenging task due to the significantly varied sizes and aspect ratios, arbitrary orientations or shapes, inaccurate annotations, etc. Due to the scalability of pixel-level prediction, segmentation-based methods can adapt to various shape texts and hence attracted considerable attention recently. However, accurate pixel-level annotations of texts are formidable, and the existing datasets for scene text detection only provide coarse-grained boundary annotations. Consequently, numerous misclassified text pixels or background pixels inside annotations always exist, degrading the performance of segmentation-based text detection methods. Generally speaking, whether a pixel belongs to text or not is highly related to the distance with the adjacent annotation boundary. With this observation, in this paper, we propose an innovative and robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gxym/textpms
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Image Processing and 3D Reconstruction