TL;DR
This paper introduces a large dataset and novel neural network architectures, including siamese networks and similarity-based methods, to significantly improve information extraction accuracy from structured documents.
Contribution
It presents a new large-scale dataset and explores innovative deep learning architectures that leverage similarity and context to enhance document information extraction.
Findings
Best model improves F1 score by 8.25 points over previous state-of-the-art.
Using similarity and context-aware architectures enhances extraction performance.
All proposed architectural components are necessary to achieve the best results.
Abstract
The automation of document processing is gaining recent attention due to the great potential to reduce manual work through improved methods and hardware. Neural networks have been successfully applied before - even though they have been trained only on relatively small datasets with hundreds of documents so far. To successfully explore deep learning techniques and improve the information extraction results, a dataset with more than twenty-five thousand documents has been compiled, anonymized and is published as a part of this work. We will expand our previous work where we proved that convolutions, graph convolutions and self-attention can work together and exploit all the information present in a structured document. Taking the fully trainable method one step further, we will now design and examine various approaches to using siamese networks, concepts of similarity, one-shot learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
