A large-scale field test on word-image classification in large   historical document collections using a traditional and two deep-learning   methods

Lambert Schomaker

arXiv:1904.08421·cs.CV·April 19, 2019·1 cites

A large-scale field test on word-image classification in large historical document collections using a traditional and two deep-learning methods

Lambert Schomaker

PDF

Open Access

TL;DR

This study evaluates traditional and deep-learning methods for word-image classification in large historical manuscript collections, revealing limitations of deep learning and the robustness of traditional approaches in this context.

Contribution

It provides a large-scale practical assessment of classification methods on diverse handwritten historical documents, highlighting the challenges and potential of traditional versus deep-learning techniques.

Findings

01

Traditional BOVW method maintains 87% accuracy across classes.

02

Deep learning methods failed to perform well with high class counts.

03

End-to-end CNN achieved about 95% accuracy when problematic books are excluded.

Abstract

This technical report describes a practical field test on word-image classification in a very large collection of more than 300 diverse handwritten historical manuscripts, with 1.6 million unique labeled images and more than 11 million images used in testing. Results indicate that several deep-learning tests completely failed (mean accuracy 83%). In the tests with more than 1000 output units (lexical words) in one-hot encoding for classification, performance steeply drops to almost zero percent accuracy, even with a modest size of the pre-final (i.e., penultimate) layer (150 units). A traditional feature method (BOVW) displays a consistent performance over numbers of classes and numbers of training examples (mean accuracy 87%). Additional tests using nearest mean on the output of the pre-final layer of an Inception V3 network, for each book, only yielded mediocre results (mean accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques