One-shot Key Information Extraction from Document with Deep Partial Graph Matching
Minghong Yao, Zhiguang Liu, Liangwei Wang, Houqiang Li, Liansheng, Zhuang

TL;DR
This paper introduces a novel end-to-end deep learning approach for one-shot key information extraction from documents, effectively handling text shifts and utilizing multimodal features, supported by a new large dataset.
Contribution
It proposes a new deep partial graph matching network with an end-to-end training framework and a one-to-one constraint, along with a large, annotated dataset for one-shot KIE.
Findings
Achieved state-of-the-art performance on DKIE dataset.
Effectively handles shifted texts beyond previous methods.
Introduced the largest one-shot KIE dataset to date.
Abstract
Automating the Key Information Extraction (KIE) from documents improves efficiency, productivity, and security in many industrial scenarios such as rapid indexing and archiving. Many existing supervised learning methods for the KIE task need to feed a large number of labeled samples and learn separate models for different types of documents. However, collecting and labeling a large dataset is time-consuming and is not a user-friendly requirement for many cloud platforms. To overcome these challenges, we propose a deep end-to-end trainable network for one-shot KIE using partial graph matching. Contrary to previous methods that the learning of similarity and solving are optimized separately, our method enables the learning of the two processes in an end-to-end framework. Existing one-shot KIE methods are either template or simple attention-based learning approach that struggle to handle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Topic Modeling · Natural Language Processing Techniques
