FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection
Xugong Qin, Yu Zhou, Dayan Wu, Yinliang Yue, Weiping Wang

TL;DR
This paper introduces FC2RN, a fully convolutional network that refines corner predictions for multi-oriented scene text detection, achieving superior accuracy on multiple datasets by improving initial geometry predictions.
Contribution
The paper proposes a novel corner refinement network with a quadrilateral RoI convolution for improved multi-oriented text detection, maintaining a simple pipeline.
Findings
Outperforms state-of-the-art methods on four public datasets.
Effective corner refinement improves detection accuracy.
Ablation study confirms the benefits of corner refinement and scoring.
Abstract
Recent scene text detection works mainly focus on curve text detection. However, in real applications, the curve texts are more scarce than the multi-oriented ones. Accurate detection of multi-oriented text with large variations of scales, orientations, and aspect ratios is of great significance. Among the multi-oriented detection methods, direct regression for the geometry of scene text shares a simple yet powerful pipeline and gets popular in academic and industrial communities, but it may produce imperfect detections, especially for long texts due to the limitation of the receptive field. In this work, we aim to improve this while keeping the pipeline simple. A fully convolutional corner refinement network (FC2RN) is proposed for accurate multi-oriented text detection, in which an initial corner prediction and a refined corner prediction are obtained at one pass. With a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Vehicle License Plate Recognition · Image Processing and 3D Reconstruction
MethodsConvolution
