Attribute CNNs for Word Spotting in Handwritten Documents

Sebastian Sudholt; Gernot Fink

arXiv:1712.07487·cs.CV·December 21, 2017

Attribute CNNs for Word Spotting in Handwritten Documents

Sebastian Sudholt, Gernot Fink

PDF

TL;DR

This paper introduces Attribute CNNs for handwritten word spotting, achieving state-of-the-art results by learning attribute representations with CNNs and end-to-end training.

Contribution

It presents novel CNN architectures and loss functions for attribute-based word spotting, advancing beyond previous SVM-based methods.

Findings

01

Achieved state-of-the-art segmentation-based word spotting results

02

Demonstrated effectiveness of end-to-end CNN training

03

Compared different word string embeddings and optimization strategies

Abstract

Word spotting has become a field of strong research interest in document image analysis over the last years. Recently, AttributeSVMs were proposed which predict a binary attribute representation. At their time, this influential method defined the state-of-the-art in segmentation-based word spotting. In this work, we present an approach for learning attribute representations with Convolutional Neural Networks (CNNs). By taking a probabilistic perspective on training CNNs, we derive two different loss functions for binary and real-valued word string embeddings. In addition, we propose two different CNN architectures, specifically designed for word spotting. These architectures are able to be trained in an end-to-end fashion. In a number of experiments, we investigate the influence of different word string embeddings and optimization strategies. We show our Attribute CNNs to achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.