One-shot Text Field Labeling using Attention and Belief Propagation for   Structure Information Extraction

Mengli Cheng; Minghui Qiu; Xing Shi; Jun Huang; Wei Lin

arXiv:2009.04153·cs.CV·September 10, 2020

One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel end-to-end deep learning approach for one-shot text field labeling in document images, leveraging attention mechanisms and belief propagation to improve accuracy and efficiency across diverse document types.

Contribution

The paper proposes a new deep model combining attention and conditional random fields for one-shot text field labeling, addressing limitations of rule-based methods and existing learning approaches.

Findings

01

Effective in crowded regions with few landmarks

02

Outperforms rule-based and existing one-shot methods

03

Generalizes well across various document types

Abstract

Structured information extraction from document images usually consists of three steps: text detection, text recognition, and text field labeling. While text detection and text recognition have been heavily studied and improved a lot in literature, text field labeling is less explored and still faces many challenges. Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document. However, collecting large amounts of document images and labeling them is difficult and sometimes impossible due to privacy issues. Deploying separate models for each type of document also consumes a lot of resources. Facing these challenges, we explore one-shot learning for the text field labeling task. Existing one-shot learning methods for the task are mostly rule-based and have difficulty in labeling fields in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AlibabaPAI/one_shot_text_labeling
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques