Learning from Mistakes: Combining Ontologies via Self-Training for   Dialogue Generation

Lena Reed; Vrindavan Harrison; Shereen Oraby; Dilek Hakkani-Tur and; Marilyn Walker

arXiv:2010.00150·cs.CL·October 2, 2020

Learning from Mistakes: Combining Ontologies via Self-Training for Dialogue Generation

Lena Reed, Vrindavan Harrison, Shereen Oraby, Dilek Hakkani-Tur and, Marilyn Walker

PDF

TL;DR

This paper investigates training natural language generators for new, larger ontologies in dialogue systems by combining existing datasets and introducing a novel self-training method to improve performance.

Contribution

It introduces a self-training approach that enhances neural dialogue generation models when combining datasets for different ontologies, addressing data scarcity issues.

Findings

01

Self-training improved model performance by 75.4%.

02

The method achieved high naturalness and semantic coherence.

03

Baseline models struggled with combined ontologies, but self-training mitigated this.

Abstract

Natural language generators (NLGs) for task-oriented dialogue typically take a meaning representation (MR) as input. They are trained end-to-end with a corpus of MR/utterance pairs, where the MRs cover a specific set of dialogue acts and domain attributes. Creation of such datasets is labor-intensive and time-consuming. Therefore, dialogue systems for new domain ontologies would benefit from using data for pre-existing ontologies. Here we explore, for the first time, whether it is possible to train an NLG for a new larger ontology using existing training sets for the restaurant domain, where each set is based on a different ontology. We create a new, larger combined ontology, and then train an NLG to produce utterances covering it. For example, if one dataset has attributes for family-friendly and rating information, and the other has attributes for decor and service, our aim is an NLG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.