# No Padding Please: Efficient Neural Handwriting Recognition

**Authors:** Gideon Maillette de Buy Wenniger, Lambert Schomaker, Andy Way

arXiv: 1902.11208 · 2020-04-03

## TL;DR

This paper introduces methods to improve the efficiency of neural handwriting recognition models, especially by eliminating padding waste through example-packing, leading to significant speedups while maintaining state-of-the-art accuracy.

## Contribution

The paper presents a novel example-packing technique for MDLSTM-based models, optimizing parallelization and reducing computation waste in handwriting recognition tasks.

## Key findings

- Speed improvement of 6.6x for word-based NHR
- Comparable accuracy to state-of-the-art models on IAM dataset
- Effective techniques for parallelization across GPUs

## Abstract

Neural handwriting recognition (NHR) is the recognition of handwritten text with deep learning models, such as multi-dimensional long short-term memory (MDLSTM) recurrent neural networks. Models with MDLSTM layers have achieved state-of-the art results on handwritten text recognition tasks. While multi-directional MDLSTM-layers have an unbeaten ability to capture the complete context in all directions, this strength limits the possibilities for parallelization, and therefore comes at a high computational cost. In this work we develop methods to create efficient MDLSTM-based models for NHR, particularly a method aimed at eliminating computation waste that results from padding. This proposed method, called example-packing, replaces wasteful stacking of padded examples with efficient tiling in a 2-dimensional grid. For word-based NHR this yields a speed improvement of factor 6.6 over an already efficient baseline of minimal padding for each batch separately. For line-based NHR the savings are more modest, but still significant. In addition to example-packing, we propose: 1) a technique to optimize parallelization for dynamic graph definition frameworks including PyTorch, using convolutions with grouping, 2) a method for parallelization across GPUs for variable-length example batches. All our techniques are thoroughly tested on our own PyTorch re-implementation of MDLSTM-based NHR models. A thorough evaluation on the IAM dataset shows that our models are performing similar to earlier implementations of state-of-the-art models. Our efficient NHR model and some of the reusable techniques discussed with it offer ways to realize relatively efficient models for the omnipresent scenario of variable-length inputs in deep learning.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.11208/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/1902.11208/full.md

## References

20 references — full list in the complete paper: https://tomesphere.com/paper/1902.11208/full.md

---
Source: https://tomesphere.com/paper/1902.11208