DocTabQA: Answering Questions from Long Documents Using Tables
Haochen Wang, Kai Hu, Haoyu Dong, and Liangcai Gao

TL;DR
This paper introduces the novel DocTabQA task, which involves generating structured tables as answers from long documents, and proposes a two-stage framework with technological innovations to improve table generation from lengthy texts.
Contribution
The paper presents the first dataset and framework for DocTabQA, enabling structured table answers from long documents, with innovations tailored for large language models.
Findings
Significant performance improvements of GPT-4 with DocTabTalk.
Effective retrieval and hierarchical table generation from long documents.
Enhanced clarity and organization in answer tables.
Abstract
We study a new problem setting of question answering (QA), referred to as DocTabQA. Within this setting, given a long document, the goal is to respond to questions by organizing the answers into structured tables derived directly from the document's content. Unlike traditional QA approaches which predominantly rely on unstructured text to formulate responses, DocTabQA aims to leverage structured tables as answers to convey information clearly and systematically, thereby enhancing user comprehension and highlighting relationships between data points. To the best of our knowledge, this problem has not been previously explored. In this paper, we introduce the QTabA dataset, encompassing 300 financial documents, accompanied by manually annotated 1.5k question-table pairs. Initially, we leverage Large Language Models (LLMs) such as GPT-4 to establish a baseline. However, it is widely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Byte Pair Encoding · Absolute Position Encodings
