CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text   Detection

Xi Zhao; Wei Feng; Zheng Zhang; Jingjing Lv; Xin Zhu; Zhangang Lin,; Jinghe Hu; Jingping Shao

arXiv:2212.02340·cs.CV·March 25, 2024

CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection

Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin,, Jinghe Hu, Jingping Shao

PDF

Open Access 1 Repo

TL;DR

This paper introduces CBNet, a plug-and-play network that improves segmentation-based scene text detection by incorporating context-aware and boundary-guided modules, achieving state-of-the-art results efficiently.

Contribution

The paper proposes a novel CBNet architecture with context-aware and boundary-guided modules that enhance segmentation accuracy and boundary precision in scene text detection.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Maintains high speed with high-resolution maps.

03

Can be integrated into various segmentation methods.

Abstract

Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion. However, the segmentation process only considers each pixel independently, and the expansion process is difficult to achieve a favorable accuracy-speed trade-off. In this paper, we propose a Context-aware and Boundary-guided Network (CBN) to tackle these problems. In CBN, a basic text detector is firstly used to predict initial segmentation results. Then, we propose a context-aware module to enhance text kernel feature representations, which considers both global and local contexts. Finally, we introduce a boundary-guided module to expand enhanced text kernels adaptively with only the pixels on the contours, which not only obtains accurate text boundaries but also keeps high speed, especially on high-resolution output maps. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiizhao/cbn.pytorch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Advanced Image and Video Retrieval Techniques