Deep Residual Text Detection Network for Scene Text

Xiangyu Zhu; Yingying Jiang; Shuli Yang; Xiaobing Wang; Wei Li; Pei; Fu; Hua Wang; Zhenbo Luo

arXiv:1711.04147·cs.CV·November 15, 2017·2 cites

Deep Residual Text Detection Network for Scene Text

Xiangyu Zhu, Yingying Jiang, Shuli Yang, Xiaobing Wang, Wei Li, Pei, Fu, Hua Wang, Zhenbo Luo

PDF

Open Access

TL;DR

This paper introduces a deep residual network-based scene text detection method that leverages multi-level features and a vertical proposal mechanism, achieving state-of-the-art accuracy on ICDAR2013.

Contribution

The novel integration of ResNet with multi-level features and a vertical proposal mechanism enhances scene text detection performance.

Findings

01

Achieved F-measure of 0.91 on ICDAR2013

02

Outperforms previous state-of-the-art methods

03

Effective use of hierarchical convolutional features

Abstract

Scene text detection is a challenging problem in computer vision. In this paper, we propose a novel text detection network based on prevalent object detection frameworks. In order to obtain stronger semantic feature, we adopt ResNet as feature extraction layers and exploit multi-level feature by combining hierarchical convolutional networks. A vertical proposal mechanism is utilized to avoid proposal classification, while regression layer remains working to improve localization accuracy. Our approach evaluated on ICDAR2013 dataset achieves F-measure of 0.91, which outperforms previous state-of-the-art results in scene text detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Vehicle License Plate Recognition