A large-scale field test on word-image classification in large historical document collections using a traditional and two deep-learning methods
Lambert Schomaker

TL;DR
This study evaluates traditional and deep-learning methods for word-image classification in large historical manuscript collections, revealing limitations of deep learning and the robustness of traditional approaches in this context.
Contribution
It provides a large-scale practical assessment of classification methods on diverse handwritten historical documents, highlighting the challenges and potential of traditional versus deep-learning techniques.
Findings
Traditional BOVW method maintains 87% accuracy across classes.
Deep learning methods failed to perform well with high class counts.
End-to-end CNN achieved about 95% accuracy when problematic books are excluded.
Abstract
This technical report describes a practical field test on word-image classification in a very large collection of more than 300 diverse handwritten historical manuscripts, with 1.6 million unique labeled images and more than 11 million images used in testing. Results indicate that several deep-learning tests completely failed (mean accuracy 83%). In the tests with more than 1000 output units (lexical words) in one-hot encoding for classification, performance steeply drops to almost zero percent accuracy, even with a modest size of the pre-final (i.e., penultimate) layer (150 units). A traditional feature method (BOVW) displays a consistent performance over numbers of classes and numbers of training examples (mean accuracy 87%). Additional tests using nearest mean on the output of the pre-final layer of an Inception V3 network, for each book, only yielded mediocre results (mean accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques
