# A Scalable Handwritten Text Recognition System

**Authors:** R. Reeve Ingle, Yasuhisa Fujii, Thomas Deselaers, Jonathan Baccash,, Ashok C. Popat

arXiv: 1904.09150 · 2019-06-18

## TL;DR

This paper presents a scalable handwritten text recognition system that leverages online handwriting data for training, introduces a non-recurrent neural network model for efficiency, and offers an integration approach for large-scale multilingual OCR systems.

## Contribution

It introduces a novel data generation pipeline using online handwriting data, a non-recurrent neural network model for line recognition, and an integration method for large-scale OCR systems.

## Key findings

- Online data significantly improves HTR models with limited real images.
- The proposed neural network achieves accuracy comparable to LSTM models.
- The system enables efficient, scalable multilingual handwritten text recognition.

## Abstract

Many studies on (Offline) Handwritten Text Recognition (HTR) systems have focused on building state-of-the-art models for line recognition on small corpora. However, adding HTR capability to a large scale multilingual OCR system poses new challenges. This paper addresses three problems in building such systems: data, efficiency, and integration. Firstly, one of the biggest challenges is obtaining sufficient amounts of high quality training data. We address the problem by using online handwriting data collected for a large scale production online handwriting recognition system. We describe our image data generation pipeline and study how online data can be used to build HTR models. We show that the data improve the models significantly under the condition where only a small number of real images is available, which is usually the case for HTR models. It enables us to support a new script at substantially lower cost. Secondly, we propose a line recognition model based on neural networks without recurrent connections. The model achieves a comparable accuracy with LSTM-based models while allowing for better parallelism in training and inference. Finally, we present a simple way to integrate HTR models into an OCR system. These constitute a solution to bring HTR capability into a large scale OCR system.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.09150/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/1904.09150/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1904.09150/full.md

---
Source: https://tomesphere.com/paper/1904.09150