Boosting Handwriting Text Recognition in Small Databases with Transfer   Learning

Jos\'e Carlos Aradillas; Juan Jos\'e Murillo-Fuentes; Pablo M. Olmos

arXiv:1804.01527·cs.CV·December 8, 2020

Boosting Handwriting Text Recognition in Small Databases with Transfer Learning

Jos\'e Carlos Aradillas, Juan Jos\'e Murillo-Fuentes, Pablo M. Olmos

PDF

3 Repos

TL;DR

This paper demonstrates that transfer learning from larger handwriting datasets significantly improves recognition accuracy and reduces training time for small datasets in offline handwriting text recognition tasks.

Contribution

It introduces a transfer learning approach for CNN-LSTM-CTC models that enhances performance on small handwriting datasets by reusing parameters from larger datasets.

Findings

01

Transfer learning reduces CER from 18.2% to 3.3% with 350 training lines.

02

Training time is significantly decreased using transfer learning.

03

The approach generalizes well to different small datasets like Parzival.

Abstract

In this paper we deal with the offline handwriting text recognition (HTR) problem with reduced training datasets. Recent HTR solutions based on artificial neural networks exhibit remarkable solutions in referenced databases. These deep learning neural networks are composed of both convolutional (CNN) and long short-term memory recurrent units (LSTM). In addition, connectionist temporal classification (CTC) is the key to avoid segmentation at character level, greatly facilitating the labeling task. One of the main drawbacks of the CNNLSTM-CTC (CLC) solutions is that they need a considerable part of the text to be transcribed for every type of calligraphy, typically in the order of a few thousands of lines. Furthermore, in some scenarios the text to transcribe is not that long, e.g. in the Washington database. The CLC typically overfits for this reduced number of training samples. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.