Workshop on Document Intelligence Understanding

Soyeon Caren Han; Yihao Ding; Siwen Luo; Josiah Poon; HeeGuen Yoon,; Zhe Huang; Paul Duuring; Eun Jung Holden

arXiv:2307.16369·cs.IR·August 1, 2023

Workshop on Document Intelligence Understanding

Soyeon Caren Han, Yihao Ding, Siwen Luo, Josiah Poon, HeeGuen Yoon,, Zhe Huang, Paul Duuring, Eun Jung Holden

PDF

Open Access

TL;DR

This workshop focuses on advancing document understanding and information extraction across various domains, emphasizing full document comprehension through a new dataset and challenge to improve multi-page analysis techniques.

Contribution

It introduces the PDFVQA dataset and challenge, promoting research on full document-level understanding beyond single-page analysis.

Findings

01

Development of the PDFVQA dataset for full document understanding

02

Introduction of a challenge to evaluate models on multi-page document questions

03

Enhanced techniques for structural and contextual comprehension of documents

Abstract

Document understanding and information extraction include different tasks to understand a document and extract valuable information automatically. Recently, there has been a rising demand for developing document understanding among different domains, including business, law, and medicine, to boost the efficiency of work that is associated with a large number of documents. This workshop aims to bring together researchers and industry developers in the field of document intelligence and understanding diverse document types to boost automatic document processing and understanding techniques. We also released a data challenge on the recently introduced document-level VQA dataset, PDFVQA. The PDFVQA challenge examines the structural and contextual understandings of proposed models on the natural full document level of multiple consecutive document pages by including questions with a sequence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Edcuational Technology Systems