A Hierarchical Model for Data-to-Text Generation

Cl\'ement Rebuffel; Laure Soulier; Geoffrey Scoutheeten and; Patrick Gallinari

arXiv:1912.10011·cs.CL·December 23, 2019

A Hierarchical Model for Data-to-Text Generation

Cl\'ement Rebuffel, Laure Soulier, Geoffrey Scoutheeten and, Patrick Gallinari

PDF

1 Repo

TL;DR

This paper introduces a hierarchical data-to-text generation model that better preserves data structure during translation, improving quality over traditional linearization methods.

Contribution

It presents a novel hierarchical approach encoding data at multiple levels, surpassing sequence linearization limitations in data-to-text tasks.

Findings

01

Outperforms baseline models on RotoWire dataset

02

Improves both qualitative and quantitative metrics

03

Effectively captures data structure in generated text

Abstract

Transcribing structured data into natural language descriptions has emerged as a challenging task, referred to as "data-to-text". These structures generally regroup multiple elements, as well as their attributes. Most attempts rely on translation encoder-decoder methods which linearize elements into a sequence. This however loses most of the structure contained in the data. In this work, we propose to overpass this limitation with a hierarchical model that encodes the data-structure at the element-level and the structure level. Evaluations on RotoWire show the effectiveness of our model w.r.t. qualitative and quantitative metrics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KaijuML/data-to-text-hierarchical
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.