Data Augmentation for Scene Text Recognition

Rowel Atienza

arXiv:2108.06949·cs.CV·August 17, 2021

Data Augmentation for Scene Text Recognition

Rowel Atienza

PDF

1 Repo

TL;DR

This paper introduces STRAug, a set of 36 image augmentation functions specifically designed for scene text recognition, significantly improving model accuracy by better simulating real-world text image variations.

Contribution

The paper presents STRAug, a novel augmentation toolkit tailored for STR that enhances training data diversity and model robustness against real-world distortions.

Findings

01

Increases accuracy of STR models on multiple datasets

02

Improves robustness to noise, artifacts, and geometric distortions

03

Easy to implement and replicate with open-source code

Abstract

Scene text recognition (STR) is a challenging task in computer vision due to the large number of possible text appearances in natural scenes. Most STR models rely on synthetic datasets for training since there are no sufficiently big and publicly available labelled real datasets. Since STR models are evaluated using real data, the mismatch between training and testing data distributions results into poor performance of models especially on challenging text that are affected by noise, artifacts, geometry, structure, etc. In this paper, we introduce STRAug which is made of 36 image augmentation functions designed for STR. Each function mimics certain text image properties that can be found in natural scenes, caused by camera sensors, or induced by signal processing operations but poorly represented in the training dataset. When applied to strong baseline models using RandAugment, STRAug…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

roatienza/straug
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRandAugment