Summarizing and Exploring Tabular Data in Conversational Search

Shuo Zhang; Zhuyun Dai; Krisztian Balog; Jamie Callan

arXiv:2005.11490·cs.IR·July 13, 2020

Summarizing and Exploring Tabular Data in Conversational Search

Shuo Zhang, Zhuyun Dai, Krisztian Balog, Jamie Callan

PDF

1 Repo

TL;DR

This paper introduces a new dataset and methods for generating natural language summaries of tables to improve conversational search, enabling better exploration of complex tabular data.

Contribution

It presents a novel dataset for table summarization in conversational search and develops baseline systems for automatic summarization.

Findings

01

Crowdsourced dataset of table summaries created.

02

Baseline models achieve competitive performance.

03

Identified challenges and future directions in table summarization.

Abstract

Tabular data provide answers to a significant portion of search queries. However, reciting an entire result table is impractical in conversational search systems. We propose to generate natural language summaries as answers to describe the complex information contained in a table. Through crowdsourcing experiments, we build a new conversation-oriented, open-domain table summarization dataset. It includes annotated table summaries, which not only answer questions but also help people explore other information in the table. We utilize this dataset to develop automatic table summarization systems as SOTA baselines. Based on the experimental results, we identify challenges and point out future research directions that this resource will support.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iai-group/sigir2020-tablesum
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.