Have convolutions already made recurrence obsolete for unconstrained   handwritten text recognition ?

Denis Coquenet; Yann Soullard; Cl\'ement Chatelain; Thierry Paquet

arXiv:2012.04954·cs.CV·December 10, 2020

Have convolutions already made recurrence obsolete for unconstrained handwritten text recognition ?

Denis Coquenet, Yann Soullard, Cl\'ement Chatelain, Thierry Paquet

PDF

TL;DR

This paper investigates whether convolutional neural networks can replace recurrent networks in unconstrained handwritten text recognition, comparing their performance and training efficiency on standard and augmented datasets.

Contribution

It provides an experimental comparison between convolutional architectures and traditional CNN+BLSTM models for handwritten text recognition, including analysis on augmented data.

Findings

01

Convolutional architectures achieve competitive accuracy with recurrent models.

02

Convolutional models train faster and support better parallelism.

03

Augmented data with printed grids impacts recognition performance.

Abstract

Unconstrained handwritten text recognition remains an important challenge for deep neural networks. These last years, recurrent networks and more specifically Long Short-Term Memory networks have achieved state-of-the-art performance in this field. Nevertheless, they are made of a large number of trainable parameters and training recurrent neural networks does not support parallelism. This has a direct influence on the training time of such architectures, with also a direct consequence on the time required to explore various architectures. Recently, recurrence-free architectures such as Fully Convolutional Networks with gated mechanisms have been proposed as one possible alternative achieving competitive results. In this paper, we explore convolutional architectures and compare them to a CNN+BLSTM baseline. We propose an experimental study regarding different architectures on an offline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.