# Attend, Copy, Parse -- End-to-end information extraction from documents

**Authors:** Rasmus Berg Palm, Florian Laws, Ole Winther

arXiv: 1812.07248 · 2021-04-26

## TL;DR

This paper introduces the Attend, Copy, Parse neural network architecture that enables end-to-end document information extraction directly from raw data, bypassing the need for costly word-level annotations.

## Contribution

The proposed model allows training on end-to-end document data, outperforming existing word classification methods and broadening applicability to real-world tasks without detailed labels.

## Key findings

- Outperforms state-of-the-art production systems on invoice data
- Capable of training directly on end-to-end document data
- Reduces reliance on expensive word-level annotations

## Abstract

Document information extraction tasks performed by humans create data consisting of a PDF or document image input, and extracted string outputs. This end-to-end data is naturally consumed and produced when performing the task because it is valuable in and of itself. It is naturally available, at no additional cost. Unfortunately, state-of-the-art word classification methods for information extraction cannot use this data, instead requiring word-level labels which are expensive to create and consequently not available for many real life tasks. In this paper we propose the Attend, Copy, Parse architecture, a deep neural network model that can be trained directly on end-to-end data, bypassing the need for word-level labels. We evaluate the proposed architecture on a large diverse set of invoices, and outperform a state-of-the-art production system based on word classification. We believe our proposed architecture can be used on many real life information extraction tasks where word classification cannot be used due to a lack of the required word-level labels.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.07248/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1812.07248/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1812.07248/full.md

---
Source: https://tomesphere.com/paper/1812.07248