KET-QA: A Dataset for Knowledge Enhanced Table Question Answering

Mengkang Hu; Haoyu Dong; Ping Luo; Shi Han; Dongmei Zhang

arXiv:2405.08099·cs.CL·May 15, 2024

KET-QA: A Dataset for Knowledge Enhanced Table Question Answering

Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han, Dongmei Zhang

PDF

Open Access

TL;DR

KET-QA introduces a knowledge base-enhanced dataset for table question answering, emphasizing the integration of external structured knowledge to improve answer accuracy, with a novel retrieval-reasoning model demonstrating significant performance gains.

Contribution

This work creates the first dataset linking tables with sub-graph knowledge bases for QA and proposes a retrieval-reasoner pipeline to leverage external knowledge effectively.

Findings

01

Model improves EM scores by 1.9 to 6.5 times over baselines.

02

Achieves up to 44.64% absolute EM score improvement.

03

Even the best model lags behind human performance, indicating challenges.

Abstract

Due to the concise and structured nature of tables, the knowledge contained therein may be incomplete or missing, posing a significant challenge for table question answering (TableQA) and data analysis systems. Most existing datasets either fail to address the issue of external knowledge in TableQA or only utilize unstructured text as supplementary information for tables. In this paper, we propose to use a knowledge base (KB) as the external knowledge source for TableQA and construct a dataset KET-QA with fine-grained gold evidence annotation. Each table in the dataset corresponds to a sub-graph of the entire KB, and every question requires the integration of information from both the table and the sub-graph to be answered. To extract pertinent information from the vast knowledge sub-graph and apply it to TableQA, we design a retriever-reasoner structured pipeline model. Experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsBalanced Selection