Towards Question Answering over Large Semi-structured Tables
Yuxiang Wang, Junhao Gan, Jianzhong Qi

TL;DR
This paper introduces TaDRe, a novel TableQA model that improves accuracy on large semi-structured tables by refining table decomposition, validated on new and existing benchmarks with state-of-the-art results.
Contribution
The paper proposes TaDRe, a new TableQA approach with decomposition refinement, addressing errors in previous methods and enhancing performance on large tables.
Findings
TaDRe achieves state-of-the-art results on large-table TableQA benchmarks.
Constructed two new large-table TableQA benchmarks using LLM-driven data generation.
Extensive experiments demonstrate the effectiveness of table decomposition refinement.
Abstract
Table Question Answering (TableQA) attracts strong interests due to the prevalence of web information presented in the form of semi-structured tables. Despite many efforts, TableQA over large tables remains an open challenge. This is because large tables may overwhelm models that try to comprehend them in full to locate question answers. Recent studies reduce input table size by decomposing tables into smaller, question-relevant sub-tables via generating programs to parse the tables. However, such solutions are subject to program generation and execution errors and are difficult to ensure decomposition quality. To address this issue, we propose TaDRe, a TableQA model that incorporates both pre- and post-table decomposition refinements to ensure table decomposition quality, hence achieving highly accurate TableQA results. To evaluate TaDRe, we construct two new large-table TableQA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Mathematics, Computing, and Information Processing
