Findings of the E2E NLG Challenge

Ond\v{r}ej Du\v{s}ek; Jekaterina Novikova; Verena Rieser

arXiv:1810.01170·cs.CL·November 22, 2018

Findings of the E2E NLG Challenge

Ond\v{r}ej Du\v{s}ek, Jekaterina Novikova, Verena Rieser

PDF

1 Repo 1 Datasets

TL;DR

This paper reports on the first shared task for end-to-end natural language generation in spoken dialogue systems, evaluating diverse approaches on a dataset with high lexical and syntactic complexity.

Contribution

It provides a comprehensive comparison of 62 systems from 17 institutions, assessing their ability to generate high-quality, diverse language output.

Findings

01

Seq2seq models perform competitively with rule-based systems.

02

Diverse approaches can handle complex discourse phenomena.

03

The shared task benchmarks current capabilities in end-to-end NLG.

Abstract

This paper summarises the experimental setup and results of the first shared task on end-to-end (E2E) natural language generation (NLG) in spoken dialogue systems. Recent end-to-end generation systems are promising since they reduce the need for data annotation. However, they are currently limited to small, delexicalised datasets. The E2E NLG shared task aims to assess whether these novel approaches can generate better-quality output by learning from a dataset containing higher lexical richness, syntactic complexity and diverse discourse phenomena. We compare 62 systems submitted by 17 institutions, covering a wide range of approaches, including machine learning architectures -- with the majority implementing sequence-to-sequence models (seq2seq) -- as well as systems based on grammatical rules and templates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UFAL-DSG/tgen
tf

Datasets

GEM/dart
dataset· 117 dl
117 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.