SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text   Recognition Models

Moonbin Yim; Yoonsik Kim; Han-Cheol Cho; Sungrae Park

arXiv:2107.09313·cs.CV·July 21, 2021

SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models

Moonbin Yim, Yoonsik Kim, Han-Cheol Cho, Sungrae Park

PDF

1 Repo

TL;DR

SynthTIGER is a new synthetic text image generator that improves scene text recognition models by producing more diverse and balanced training data, outperforming existing synthetic datasets.

Contribution

It introduces SynthTIGER, a comprehensive synthetic text image generator that integrates multiple synthesis techniques and addresses data imbalance issues.

Findings

01

SynthTIGER outperforms combined existing synthetic datasets in STR tasks.

02

Ablation studies confirm the effectiveness of SynthTIGER's components.

03

Guidelines for synthetic image generation improve STR model training.

Abstract

For successful scene text recognition (STR) models, synthetic text image generators have alleviated the lack of annotated text images from the real world. Specifically, they generate multiple text images with diverse backgrounds, font styles, and text shapes and enable STR models to learn visual patterns that might not be accessible from manually annotated data. In this paper, we introduce a new synthetic text image generator, SynthTIGER, by analyzing techniques used for text image synthesis and integrating effective ones under a single algorithm. Moreover, we propose two techniques that alleviate the long-tail problem in length and character distributions of training data. In our experiments, SynthTIGER achieves better STR performance than the combination of synthetic datasets, MJSynth (MJ) and SynthText (ST). Our ablation study demonstrates the benefits of using sub-components of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clovaai/synthtiger
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.