Towards Robust Curve Text Detection with Conditional Spatial Expansion
Zichuan Liu, Guosheng Lin, Sheng Yang, Fayao Liu, Weisi Lin, Wang, Ling Goh

TL;DR
This paper introduces a novel Conditional Spatial Expansion mechanism for robust curve text detection, treating it as a region expansion process rather than traditional polygon regression, leading to improved accuracy and robustness.
Contribution
The paper proposes a new CSE method that enhances curve text detection by modeling it as a region expansion, which is more robust and simpler than existing approaches.
Findings
Achieves a state-of-the-art F-score of 78.4% on benchmarks.
Effectively handles texts with various shapes, sizes, and orientations.
Reduces false positives from text-like textures.
Abstract
It is challenging to detect curve texts due to their irregular shapes and varying sizes. In this paper, we first investigate the deficiency of the existing curve detection methods and then propose a novel Conditional Spatial Expansion (CSE) mechanism to improve the performance of curve text detection. Instead of regarding the curve text detection as a polygon regression or a segmentation problem, we treat it as a region expansion process. Our CSE starts with a seed arbitrarily initialized within a text region and progressively merges neighborhood regions based on the extracted local features by a CNN and contextual information of merged regions. The CSE is highly parameterized and can be seamlessly integrated into existing object detection frameworks. Enhanced by the data-dependent CSE mechanism, our curve text detection system provides robust instance-level text region extraction with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction
