Data-to-Text Generation with Content Selection and Planning

Ratish Puduppully; Li Dong; Mirella Lapata

arXiv:1809.00582·cs.CL·April 15, 2019

Data-to-Text Generation with Content Selection and Planning

Ratish Puduppully, Li Dong, Mirella Lapata

PDF

2 Repos

TL;DR

This paper introduces a neural network model for data-to-text generation that explicitly incorporates content selection and planning, leading to improved performance over existing end-to-end models.

Contribution

It proposes a two-stage neural architecture that separates content planning from text generation, enhancing control and output quality in data-to-text tasks.

Findings

01

Outperforms strong baselines on RotoWire dataset

02

Improves state-of-the-art results

03

Both automatic and human evaluations confirm effectiveness

Abstract

Recent advances in data-to-text generation have led to the use of large-scale datasets and neural network models which are trained end-to-end, without explicitly modeling what to say and in what order. In this work, we present a neural network architecture which incorporates content selection and planning without sacrificing end-to-end training. We decompose the generation task into two stages. Given a corpus of data records (paired with descriptive documents), we first generate a content plan highlighting which information should be mentioned and in which order and then generate the document while taking the content plan into account. Automatic and human-based evaluation experiments show that our model outperforms strong baselines improving the state-of-the-art on the recently released RotoWire dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.