Kernel Proposal Network for Arbitrary Shape Text Detection
Shi-Xue Zhang, Xiaobin Zhu, Jie-Bo Hou, Chun Yang, Xu-Cheng Yin

TL;DR
This paper introduces Kernel Proposal Network (KPN), a novel method for arbitrary shape text detection that effectively separates neighboring text instances using dynamic convolution kernels and orthogonal learning loss, improving robustness and efficiency.
Contribution
The paper presents the first application of dynamic convolution kernels with orthogonal constraints for separating neighboring text instances in scene images.
Findings
Achieves state-of-the-art performance on challenging datasets.
Effectively separates neighboring text instances with improved robustness.
Demonstrates high efficiency and accuracy in arbitrary shape text detection.
Abstract
Segmentation-based methods have achieved great success for arbitrary shape text detection. However, separating neighboring text instances is still one of the most challenging problems due to the complexity of texts in scene images. In this paper, we propose an innovative Kernel Proposal Network (dubbed KPN) for arbitrary shape text detection. The proposed KPN can separate neighboring text instances by classifying different texts into instance-independent feature maps, meanwhile avoiding the complex aggregation process existing in segmentation-based arbitrary shape text detection methods. To be concrete, our KPN will predict a Gaussian center map for each text image, which will be used to extract a series of candidate kernel proposals (i.e., dynamic convolution kernel) from the embedding feature maps according to their corresponding keypoint positions. To enforce the independence between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsConvolution
