Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks
Andreas K\"olsch, Ashutosh Mishra, Saurabh Varshneya, Muhammad Zeshan, Afzal, Marcus Liwicki

TL;DR
This paper presents a new dataset of historic German documents with challenging handwritten annotations and evaluates Fully Convolutional Neural Networks for pixel-wise annotation recognition, achieving high accuracy.
Contribution
It introduces a novel dataset for challenging handwritten annotation recognition and assesses FCNN-based methods, providing insights into effective training strategies.
Findings
Best model achieves 95.6% IoU score
Comparison of data augmentation strategies
Evaluation on challenging historic documents
Abstract
This paper introduces a very challenging dataset of historic German documents and evaluates Fully Convolutional Neural Network (FCNN) based methods to locate handwritten annotations of any kind in these documents. The handwritten annotations can appear in form of underlines and text by using various writing instruments, e.g., the use of pencils makes the data more challenging. We train and evaluate various end-to-end semantic segmentation approaches and report the results. The task is to classify the pixels of documents into two classes: background and handwritten annotation. The best model achieves a mean Intersection over Union (IoU) score of 95.6% on the test documents of the presented dataset. We also present a comparison of different strategies used for data augmentation and training on our presented dataset. For evaluation, we use the Layout Analysis Evaluator for the ICDAR 2017…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
