One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction
Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei Lin

TL;DR
This paper introduces a novel end-to-end deep learning approach for one-shot text field labeling in document images, leveraging attention mechanisms and belief propagation to improve accuracy and efficiency across diverse document types.
Contribution
The paper proposes a new deep model combining attention and conditional random fields for one-shot text field labeling, addressing limitations of rule-based methods and existing learning approaches.
Findings
Effective in crowded regions with few landmarks
Outperforms rule-based and existing one-shot methods
Generalizes well across various document types
Abstract
Structured information extraction from document images usually consists of three steps: text detection, text recognition, and text field labeling. While text detection and text recognition have been heavily studied and improved a lot in literature, text field labeling is less explored and still faces many challenges. Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document. However, collecting large amounts of document images and labeling them is difficult and sometimes impossible due to privacy issues. Deploying separate models for each type of document also consumes a lot of resources. Facing these challenges, we explore one-shot learning for the text field labeling task. Existing one-shot learning methods for the task are mostly rule-based and have difficulty in labeling fields in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
