Recognizing Challenging Handwritten Annotations with Fully Convolutional   Networks

Andreas K\"olsch; Ashutosh Mishra; Saurabh Varshneya; Muhammad Zeshan; Afzal; Marcus Liwicki

arXiv:1804.00236·cs.CV·December 7, 2018

Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

Andreas K\"olsch, Ashutosh Mishra, Saurabh Varshneya, Muhammad Zeshan, Afzal, Marcus Liwicki

PDF

TL;DR

This paper presents a new dataset of historic German documents with challenging handwritten annotations and evaluates Fully Convolutional Neural Networks for pixel-wise annotation recognition, achieving high accuracy.

Contribution

It introduces a novel dataset for challenging handwritten annotation recognition and assesses FCNN-based methods, providing insights into effective training strategies.

Findings

01

Best model achieves 95.6% IoU score

02

Comparison of data augmentation strategies

03

Evaluation on challenging historic documents

Abstract

This paper introduces a very challenging dataset of historic German documents and evaluates Fully Convolutional Neural Network (FCNN) based methods to locate handwritten annotations of any kind in these documents. The handwritten annotations can appear in form of underlines and text by using various writing instruments, e.g., the use of pencils makes the data more challenging. We train and evaluate various end-to-end semantic segmentation approaches and report the results. The task is to classify the pixels of documents into two classes: background and handwritten annotation. The best model achieves a mean Intersection over Union (IoU) score of 95.6% on the test documents of the presented dataset. We also present a comparison of different strategies used for data augmentation and training on our presented dataset. For evaluation, we use the Layout Analysis Evaluator for the ICDAR 2017…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.