Generating Synthetic Data for Text Recognition

Praveen Krishnan; C.V. Jawahar

arXiv:1608.04224·cs.CV·August 16, 2016·35 cites

Generating Synthetic Data for Text Recognition

Praveen Krishnan, C.V. Jawahar

PDF

Open Access 1 Repo

TL;DR

This paper presents a method for generating a large synthetic handwritten dataset using open source fonts and augmentation, aiming to improve handwritten word recognition and spotting.

Contribution

It introduces a new large-scale synthetic handwritten dataset and a framework for generating realistic handwritten images for training deep learning models.

Findings

01

Released 9 million synthetic handwritten word images.

02

Enhanced training data for improved recognition performance.

03

Facilitated advancements in handwritten word spotting.

Abstract

Generating synthetic images is an art which emulates the natural process of image generation in a closest possible manner. In this work, we exploit such a framework for data generation in handwritten domain. We render synthetic data using open source fonts and incorporate data augmentation schemes. As part of this work, we release 9M synthetic handwritten word image corpus which could be useful for training deep network architectures and advancing the performance in handwritten word spotting and recognition tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Belval/TextRecognitionDataGenerator
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques