Document Intelligence Metrics for Visually Rich Document Evaluation

Jonathan DeGange; Swapnil Gupta; Zhuoyu Han; Krzysztof Wilkosz; Adam; Karwan

arXiv:2205.11215·cs.AI·May 24, 2022

Document Intelligence Metrics for Visually Rich Document Evaluation

Jonathan DeGange, Swapnil Gupta, Zhuoyu Han, Krzysztof Wilkosz, Adam, Karwan

PDF

Open Access 1 Repo

TL;DR

This paper introduces DI-Metrics, a Python library for evaluating Visually-Rich Document models using diverse metrics, and demonstrates its application on the CORD dataset to compare state-of-the-art models.

Contribution

The paper presents DI-Metrics, a comprehensive open-source evaluation library for VRD models, incorporating text, geometric, and hierarchical metrics.

Findings

01

DI-Metrics effectively evaluates VRD models.

02

Comparison of three SOTA models and one industry model.

03

Open-source library available on GitHub.

Abstract

The processing of Visually-Rich Documents (VRDs) is highly important in information extraction tasks associated with Document Intelligence. We introduce DI-Metrics, a Python library devoted to VRD model evaluation comprising text-based, geometric-based and hierarchical metrics for information extraction tasks. We apply DI-Metrics to evaluate information extraction performance using publicly available CORD dataset, comparing performance of three SOTA models and one industry model. The open-source library is available on GitHub.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

metricsdi/dimetrics
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Humanities and Scholarship · Mathematics, Computing, and Information Processing · Semantic Web and Ontologies