Total-Text: A Comprehensive Dataset for Scene Text Detection and   Recognition

Chee Kheng Chng; Chee Seng Chan

arXiv:1710.10400·cs.CV·October 31, 2017

Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition

Chee Kheng Chng, Chee Seng Chan

PDF

1 Repo 1 Datasets

TL;DR

Total-Text is a new dataset designed to include curved, multi-oriented, and horizontal scene texts, filling a significant gap in existing datasets, and it enables evaluation of segmentation-based text detection methods on curved text.

Contribution

The paper introduces Total-Text, a comprehensive dataset with diverse text orientations including curved text, and benchmarks segmentation-based detection methods on it.

Findings

01

DeconvNet performs robustly on curved text in Total-Text.

02

Total-Text dataset contains diverse orientations, with over half of images having multiple orientations.

03

Segmentation-based methods are effective for curved text detection.

Abstract

Text in curve orientation, despite being one of the common text orientations in real world environment, has close to zero existence in well received scene text datasets such as ICDAR2013 and MSRA-TD500. The main motivation of Total-Text is to fill this gap and facilitate a new research direction for the scene text community. On top of the conventional horizontal and multi-oriented texts, it features curved-oriented text. Total-Text is highly diversified in orientations, more than half of its images have a combination of more than two orientations. Recently, a new breed of solutions that casted text detection as a segmentation problem has demonstrated their effectiveness against multi-oriented text. In order to evaluate its robustness against curved text, we fine-tuned DeconvNet and benchmark it on Total-Text. Total-Text with its annotation is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cs-chan/Total-Text-Dataset
noneOfficial

Datasets

Voxel51/Total-Text-Dataset
dataset· 2.4k dl
2.4k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.