A Good Sample is Hard to Find: Noise Injection Sampling and   Self-Training for Neural Language Generation Models

Chris Kedzie; Kathleen McKeown

arXiv:1911.03373·cs.CL·November 11, 2019

A Good Sample is Hard to Find: Noise Injection Sampling and Self-Training for Neural Language Generation Models

Chris Kedzie, Kathleen McKeown

PDF

1 Repo

TL;DR

This paper introduces a self-training approach with noise injection sampling to improve neural language generation models, enabling them to produce more semantically accurate utterances for unseen inputs.

Contribution

It proposes an architecture-agnostic self-training method that enhances data with noise-injected samples, significantly improving semantic fidelity in generated utterances.

Findings

01

Models trained with augmented data produce more semantically correct utterances.

02

Simple encoder-decoder models achieve state-of-the-art quality after augmentation.

03

The method improves both automatic and human evaluation metrics.

Abstract

Deep neural networks (DNN) are quickly becoming the de facto standard modeling method for many natural language generation (NLG) tasks. In order for such models to truly be useful, they must be capable of correctly generating utterances for novel meaning representations (MRs) at test time. In practice, even sophisticated DNNs with various forms of semantic control frequently fail to generate utterances faithful to the input MR. In this paper, we propose an architecture agnostic self-training method to sample novel MR/text utterance pairs to augment the original training data. Remarkably, after training on the augmented data, even simple encoder-decoder models with greedy decoding are capable of generating semantically correct utterances that are as good as state-of-the-art outputs in both automatic and human evaluations of quality.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kedz/noiseylg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest