Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi, Zheng, Yauwai Yim, Yangqiu Song

TL;DR
This paper introduces a new benchmark dataset and a novel pipeline for text-to-table generation that emphasizes information extraction, reasoning, and integration, showing improved performance over existing methods.
Contribution
The paper presents LiveSum, a new dataset for real-time commentary-based table generation, and proposes the T^3 pipeline to enhance LLM performance without explicit training.
Findings
LLMs struggle with text-to-table tasks even after fine-tuning.
The T^3 pipeline significantly improves performance without training.
The approach generalizes well to other datasets.
Abstract
The task of condensing large chunks of textual information into concise and structured tables has gained attention recently due to the emergence of Large Language Models (LLMs) and their potential benefit for downstream tasks, such as text summarization and text mining. Previous approaches often generate tables that directly replicate information from the text, limiting their applicability in broader contexts, as text-to-table generation in real-life scenarios necessitates information extraction, reasoning, and integration. However, there is a lack of both datasets and methodologies towards this task. In this paper, we introduce LiveSum, a new benchmark dataset created for generating summary tables of competitions based on real-time commentary texts. We evaluate the performances of state-of-the-art LLMs on this task in both fine-tuning and zero-shot settings, and additionally propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMathematics, Computing, and Information Processing · Natural Language Processing Techniques · Advanced Text Analysis Techniques
