FC2RN: A Fully Convolutional Corner Refinement Network for Accurate   Multi-Oriented Scene Text Detection

Xugong Qin; Yu Zhou; Dayan Wu; Yinliang Yue; Weiping Wang

arXiv:2007.05113·cs.CV·July 13, 2020·6 cites

FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection

Xugong Qin, Yu Zhou, Dayan Wu, Yinliang Yue, Weiping Wang

PDF

Open Access

TL;DR

This paper introduces FC2RN, a fully convolutional network that refines corner predictions for multi-oriented scene text detection, achieving superior accuracy on multiple datasets by improving initial geometry predictions.

Contribution

The paper proposes a novel corner refinement network with a quadrilateral RoI convolution for improved multi-oriented text detection, maintaining a simple pipeline.

Findings

01

Outperforms state-of-the-art methods on four public datasets.

02

Effective corner refinement improves detection accuracy.

03

Ablation study confirms the benefits of corner refinement and scoring.

Abstract

Recent scene text detection works mainly focus on curve text detection. However, in real applications, the curve texts are more scarce than the multi-oriented ones. Accurate detection of multi-oriented text with large variations of scales, orientations, and aspect ratios is of great significance. Among the multi-oriented detection methods, direct regression for the geometry of scene text shares a simple yet powerful pipeline and gets popular in academic and industrial communities, but it may produce imperfect detections, especially for long texts due to the limitation of the receptive field. In this work, we aim to improve this while keeping the pipeline simple. A fully convolutional corner refinement network (FC2RN) is proposed for accurate multi-oriented text detection, in which an initial corner prediction and a refined corner prediction are obtained at one pass. With a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction

MethodsConvolution