Bidirectional Regression for Arbitrary-Shaped Text Detection

Tao Sheng; Zhouhui Lian

arXiv:2107.06129·cs.CV·July 14, 2021

Bidirectional Regression for Arbitrary-Shaped Text Detection

Tao Sheng, Zhouhui Lian

PDF

Open Access

TL;DR

This paper introduces a novel bidirectional regression method for arbitrary-shaped text detection that effectively utilizes context information and boundary pixels, leading to improved accuracy on challenging scene text benchmarks.

Contribution

It proposes a new text instance expression integrating foreground and background info, with a post-processing algorithm for precise text instance reconstruction.

Findings

01

Achieves 83.4% F-score on Total-Text

02

Achieves 82.4% F-score on MSRA-TD500

03

Outperforms or matches state-of-the-art methods

Abstract

Arbitrary-shaped text detection has recently attracted increasing interests and witnessed rapid development with the popularity of deep learning algorithms. Nevertheless, existing approaches often obtain inaccurate detection results, mainly due to the relatively weak ability to utilize context information and the inappropriate choice of offset references. This paper presents a novel text instance expression which integrates both foreground and background information into the pipeline, and naturally uses the pixels near text boundaries as the offset starts. Besides, a corresponding post-processing algorithm is also designed to sequentially combine the four prediction results and reconstruct the text instance accurately. We evaluate our method on several challenging scene text benchmarks, including both curved and multi-oriented text datasets. Experimental results demonstrate that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Text and Document Classification Technologies · Advanced Image and Video Retrieval Techniques