Adaptive Segmentation Network for Scene Text Detection
Guiqin Zhao

TL;DR
This paper introduces ASNet, an adaptive scene text detection network that automatically learns segmentation thresholds and employs a novel feature pyramid for improved detection of texts with extreme aspect ratios, achieving state-of-the-art results.
Contribution
The paper presents a novel adaptive segmentation network with automatic threshold learning and a global-information enhanced feature pyramid for superior scene text detection.
Findings
Achieves state-of-the-art performance on four benchmarks.
Effectively detects texts with extreme aspect ratios.
Validates contributions through extensive ablation studies.
Abstract
Inspired by deep convolution segmentation algorithms, scene text detectors break the performance ceiling of datasets steadily. However, these methods often encounter threshold selection bottlenecks and have poor performance on text instances with extreme aspect ratios. In this paper, we propose to automatically learn the discriminate segmentation threshold, which distinguishes text pixels from background pixels for segmentation-based scene text detectors and then further reduces the time-consuming manual parameter adjustment. Besides, we design a Global-information Enhanced Feature Pyramid Network (GE-FPN) for capturing text instances with macro size and extreme aspect ratios. Following the GE-FPN, we introduce a cascade optimization structure to further refine the text instances. Finally, together with the proposed threshold learning strategy and text detection structure, we design an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Text and Document Classification Technologies
MethodsConvolution
