DocTabQA: Answering Questions from Long Documents Using Tables

Haochen Wang; Kai Hu; Haoyu Dong; and Liangcai Gao

arXiv:2408.11490·cs.CL·August 22, 2024

DocTabQA: Answering Questions from Long Documents Using Tables

Haochen Wang, Kai Hu, Haoyu Dong, and Liangcai Gao

PDF

Open Access 1 Repo

TL;DR

This paper introduces the novel DocTabQA task, which involves generating structured tables as answers from long documents, and proposes a two-stage framework with technological innovations to improve table generation from lengthy texts.

Contribution

The paper presents the first dataset and framework for DocTabQA, enabling structured table answers from long documents, with innovations tailored for large language models.

Findings

01

Significant performance improvements of GPT-4 with DocTabTalk.

02

Effective retrieval and hierarchical table generation from long documents.

03

Enhanced clarity and organization in answer tables.

Abstract

We study a new problem setting of question answering (QA), referred to as DocTabQA. Within this setting, given a long document, the goal is to respond to questions by organizing the answers into structured tables derived directly from the document's content. Unlike traditional QA approaches which predominantly rely on unstructured text to formulate responses, DocTabQA aims to leverage structured tables as answers to convey information clearly and systematically, thereby enhancing user comprehension and highlighting relationships between data points. To the best of our knowledge, this problem has not been previously explored. In this paper, we introduce the QTabA dataset, encompassing 300 financial documents, accompanied by manually annotated 1.5k question-table pairs. Initially, we leverage Large Language Models (LLMs) such as GPT-4 to establish a baseline. However, it is widely…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

smilewhc/doctabqa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsAttention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Byte Pair Encoding · Absolute Position Encodings