Evaluation of Table Representations to Answer Questions from Tables in Documents : A Case Study using 3GPP Specifications
Sujoy Roychowdhury, Sumit Soman, HG Ranjani, Avantika Sharma, Neeraj, Gunda, Sai Krishna Bala

TL;DR
This study evaluates different table representations for question answering in technical documents, specifically 3GPP specifications, finding that row-level representations with headers enhance retrieval accuracy.
Contribution
It systematically compares various table representations in a question answering context and demonstrates the effectiveness of row-level with header inclusion for technical documents.
Findings
Row-level representations improve retrieval performance.
Including table headers in each cell enhances accuracy.
Structural information in tables is crucial for QA tasks.
Abstract
With the ubiquitous use of document corpora for question answering, one important aspect which is especially relevant for technical documents is the ability to extract information from tables which are interspersed with text. The major challenge in this is that unlike free-flow text or isolated set of tables, the representation of a table in terms of what is a relevant chunk is not obvious. We conduct a series of experiments examining various representations of tabular data interspersed with text to understand the relative benefits of different representations. We choose a corpus of Generation Partnership Project (3GPP) documents since they are heavily interspersed with tables. We create expert curated dataset of question answers to evaluate our approach. We conclude that row level representations with corresponding table header information being included in every cell improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Service-Oriented Architecture and Web Services · Advanced Text Analysis Techniques
MethodsSparse Evolutionary Training
