# Adversarial Generation of Handwritten Text Images Conditioned on   Sequences

**Authors:** Eloi Alonso, Bastien Moysset, Ronaldo Messina

arXiv: 1903.00277 · 2020-11-12

## TL;DR

This paper introduces a GAN-based system that generates realistic handwritten word images conditioned on text sequences, aiming to augment training data for handwriting recognition systems.

## Contribution

It presents a novel GAN architecture with an auxiliary recognition network to control generated text content, improving synthetic data quality for training handwriting recognition models.

## Key findings

- Generated images are realistic on French and Arabic datasets.
- Synthetic images slightly improve recognition system performance.
- The method effectively controls the textual content of generated handwriting.

## Abstract

State-of-the-art offline handwriting text recognition systems tend to use neural networks and therefore require a large amount of annotated data to be trained. In order to partially satisfy this requirement, we propose a system based on Generative Adversarial Networks (GAN) to produce synthetic images of handwritten words. We use bidirectional LSTM recurrent layers to get an embedding of the word to be rendered, and we feed it to the generator network. We also modify the standard GAN by adding an auxiliary network for text recognition. The system is then trained with a balanced combination of an adversarial loss and a CTC loss. Together, these extensions to GAN enable to control the textual content of the generated word images. We obtain realistic images on both French and Arabic datasets, and we show that integrating these synthetic images into the existing training data of a text recognition system can slightly enhance its performance.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.00277/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1903.00277/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/1903.00277/full.md

---
Source: https://tomesphere.com/paper/1903.00277