CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection
Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin,, Jinghe Hu, Jingping Shao

TL;DR
This paper introduces CBNet, a plug-and-play network that improves segmentation-based scene text detection by incorporating context-aware and boundary-guided modules, achieving state-of-the-art results efficiently.
Contribution
The paper proposes a novel CBNet architecture with context-aware and boundary-guided modules that enhance segmentation accuracy and boundary precision in scene text detection.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Maintains high speed with high-resolution maps.
Can be integrated into various segmentation methods.
Abstract
Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion. However, the segmentation process only considers each pixel independently, and the expansion process is difficult to achieve a favorable accuracy-speed trade-off. In this paper, we propose a Context-aware and Boundary-guided Network (CBN) to tackle these problems. In CBN, a basic text detector is firstly used to predict initial segmentation results. Then, we propose a context-aware module to enhance text kernel feature representations, which considers both global and local contexts. Finally, we introduce a boundary-guided module to expand enhanced text kernels adaptively with only the pixels on the contours, which not only obtains accurate text boundaries but also keeps high speed, especially on high-resolution output maps. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Advanced Image and Video Retrieval Techniques
