Field typing for improved recognition on heterogeneous handwritten forms

Ciprian Tomoiaga (1); Paul Feng (1); Mathieu Salzmann (2); Patrick; Jayet (1) ((1) AXA REV Lausanne; (2) CVLab EPFL Switzerland)

arXiv:1909.10120·cs.CV·September 24, 2019

Field typing for improved recognition on heterogeneous handwritten forms

Ciprian Tomoiaga (1), Paul Feng (1), Mathieu Salzmann (2), Patrick, Jayet (1) ((1) AXA REV Lausanne, (2) CVLab EPFL Switzerland)

PDF

1 Repo

TL;DR

This paper improves handwritten form recognition by incorporating field typing into LSTM models and generating synthetic training data, addressing real-world heterogeneity and ambiguity.

Contribution

It introduces a field typing method within an LSTM architecture and a synthetic data generation procedure for better recognition of heterogeneous handwritten forms.

Findings

01

Enhanced recognition accuracy on real-world forms

02

Effective use of synthetic data for training

03

Improved generalization to diverse handwriting styles

Abstract

Offline handwriting recognition has undergone continuous progress over the past decades. However, existing methods are typically benchmarked on free-form text datasets that are biased towards good-quality images and handwriting styles, and homogeneous content. In this paper, we show that state-of-the-art algorithms, employing long short-term memory (LSTM) layers, do not readily generalize to real-world structured documents, such as forms, due to their highly heterogeneous and out-of-vocabulary content, and to the inherent ambiguities of this content. To address this, we propose to leverage the content type within an LSTM-based architecture. Furthermore, we introduce a procedure to generate synthetic data to train this architecture without requiring expensive manual annotations. We demonstrate the effectiveness of our approach at transcribing text on a challenging, real-world dataset of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cipri-tom/type-aware-crnn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.