DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents
Thomas Constum, Pierrick Tranouez, Thierry Paquet

TL;DR
DANIEL is an end-to-end neural architecture that integrates layout analysis, handwriting recognition, and named entity recognition for handwritten documents, achieving state-of-the-art results and faster processing times.
Contribution
It introduces DANIEL, a novel fully end-to-end model combining a convolutional encoder and transformer decoder for comprehensive handwritten document understanding.
Findings
State-of-the-art performance on RIMES 2009 and M-POPP datasets.
Faster processing compared to existing methods.
Effective multi-language and multi-task learning capability.
Abstract
Information extraction from handwritten documents involves traditionally three distinct steps: Document Layout Analysis, Handwritten Text Recognition, and Named Entity Recognition. Recent approaches have attempted to integrate these steps into a single process using fully end-to-end architectures. Despite this, these integrated approaches have not yet matched the performance of language models, when applied to information extraction in plain text. In this paper, we introduce DANIEL (Document Attention Network for Information Extraction and Labelling), a fully end-to-end architecture integrating a language model and designed for comprehensive handwritten document understanding. DANIEL performs layout recognition, handwriting recognition, and named entity recognition on full-page documents. Moreover, it can simultaneously learn across multiple languages, layouts, and tasks. For named…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Natural Language Processing Techniques
MethodsSoftmax · Attention Is All You Need · Ontology
