Revising FUNSD dataset for key-value detection in document images

Hieu M. Vu; Diep Thi-Ngoc Nguyen

arXiv:2010.05322·cs.CV·October 13, 2020

Revising FUNSD dataset for key-value detection in document images

Hieu M. Vu, Diep Thi-Ngoc Nguyen

PDF

Open Access 3 Datasets

TL;DR

This paper revises the FUNSD dataset for key-value detection in document images by addressing labeling inconsistencies and demonstrates baseline and improved models for key-value extraction.

Contribution

It identifies labeling issues in FUNSD and provides a revised version along with baseline and enhanced models for key-value detection.

Findings

01

Revised FUNSD dataset with corrected labels.

02

Baseline UNet model for key-value detection.

03

Improved UNet with Channel-Invariant Deformable Convolution.

Abstract

FUNSD is one of the limited publicly available datasets for information extraction from document im-ages. The information in the FUNSD dataset is defined by text areas of four categories ("key", "value", "header", "other", and "background") and connectivity between areas as key-value relations. In-specting FUNSD, we found several inconsistency in labeling, which impeded its applicability to thekey-value extraction problem. In this report, we described some labeling issues in FUNSD and therevision we made to the dataset. We also reported our implementation of for key-value detection onFUNSD using a UNet model as baseline results and an improved UNet model with Channel-InvariantDeformable Convolution.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques

MethodsConvolution