TL;DR
This paper demonstrates that transfer learning from larger handwriting datasets significantly improves recognition accuracy and reduces training time for small datasets in offline handwriting text recognition tasks.
Contribution
It introduces a transfer learning approach for CNN-LSTM-CTC models that enhances performance on small handwriting datasets by reusing parameters from larger datasets.
Findings
Transfer learning reduces CER from 18.2% to 3.3% with 350 training lines.
Training time is significantly decreased using transfer learning.
The approach generalizes well to different small datasets like Parzival.
Abstract
In this paper we deal with the offline handwriting text recognition (HTR) problem with reduced training datasets. Recent HTR solutions based on artificial neural networks exhibit remarkable solutions in referenced databases. These deep learning neural networks are composed of both convolutional (CNN) and long short-term memory recurrent units (LSTM). In addition, connectionist temporal classification (CTC) is the key to avoid segmentation at character level, greatly facilitating the labeling task. One of the main drawbacks of the CNNLSTM-CTC (CLC) solutions is that they need a considerable part of the text to be transcribed for every type of calligraphy, typically in the order of a few thousands of lines. Furthermore, in some scenarios the text to transcribe is not that long, e.g. in the Washington database. The CLC typically overfits for this reduced number of training samples. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
