Neural Ctrl-F: Segmentation-free Query-by-String Word Spotting in   Handwritten Manuscript Collections

Tomas Wilkinson; Jonas Lindstr\"om; Anders Brun

arXiv:1703.07645·cs.CV·August 18, 2017·1 cites

Neural Ctrl-F: Segmentation-free Query-by-String Word Spotting in Handwritten Manuscript Collections

Tomas Wilkinson, Jonas Lindstr\"om, Anders Brun

PDF

Open Access 1 Repo

TL;DR

This paper introduces Ctrl-F-Net, a deep neural network model for segmentation-free query-by-string word spotting in handwritten manuscripts, significantly improving search accuracy in historical documents.

Contribution

The paper presents an end-to-end trainable neural network model that outperforms existing segmentation-free methods for handwritten word spotting.

Findings

01

Outperforms previous state-of-the-art methods on benchmark datasets

02

Effective in challenging historical handwritten texts

03

Useful for real-world applications like historical research

Abstract

In this paper, we approach the problem of segmentation-free query-by-string word spotting for handwritten documents. In other words, we use methods inspired from computer vision and machine learning to search for words in large collections of digitized manuscripts. In particular, we are interested in historical handwritten texts, which are often far more challenging than modern printed documents. This task is important, as it provides people with a way to quickly find what they are looking for in large collections that are tedious and difficult to read manually. To this end, we introduce an end-to-end trainable model based on deep neural networks that we call Ctrl-F-Net. Given a full manuscript page, the model simultaneously generates region proposals, and embeds these into a distributed word embedding space, where searches are performed. We evaluate the model on common benchmarks for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tomfalainen/neural-ctrlf
torch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Image Processing and 3D Reconstruction