Improving Text Proposals for Scene Images with Fully Convolutional   Networks

Dena Bazazian; Raul Gomez; Anguelos Nicolaou; Lluis Gomez; Dimosthenis; Karatzas; Andrew D. Bagdanov

arXiv:1702.05089·cs.CV·February 17, 2017·20 cites

Improving Text Proposals for Scene Images with Fully Convolutional Networks

Dena Bazazian, Raul Gomez, Anguelos Nicolaou, Lluis Gomez, Dimosthenis, Karatzas, Andrew D. Bagdanov

PDF

Open Access 1 Repo

TL;DR

This paper enhances text proposal methods for scene images by integrating Fully Convolutional Networks, significantly improving proposal ranking and achieving state-of-the-art results on benchmark datasets.

Contribution

It introduces a novel combination of Text Proposals with Fully Convolutional Networks to improve proposal ranking accuracy in scene text detection.

Findings

01

Superior performance on ICDAR RRC dataset

02

Outperforms current state-of-the-art on COCO-text

03

Improved proposal ranking accuracy

Abstract

Text Proposals have emerged as a class-dependent version of object proposals - efficient approaches to reduce the search space of possible text object locations in an image. Combined with strong word classifiers, text proposals currently yield top state of the art results in end-to-end scene text recognition. In this paper we propose an improvement over the original Text Proposals algorithm of Gomez and Karatzas (2016), combining it with Fully Convolutional Networks to improve the ranking of proposals. Results on the ICDAR RRC and the COCO-text datasets show superior performance over current state-of-the-art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gombru/TextFCN
caffe2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Retrieval and Classification Techniques · Video Analysis and Summarization