Few-Shot NLG with Pre-Trained Language Model

Zhiyu Chen; Harini Eavani; Wenhu Chen; Yinyin Liu; William Yang Wang

arXiv:1904.09521·cs.CL·April 21, 2020·6 cites

Few-Shot NLG with Pre-Trained Language Model

Zhiyu Chen, Harini Eavani, Wenhu Chen, Yinyin Liu, William Yang Wang

PDF

Open Access 2 Repos

TL;DR

This paper introduces a few-shot natural language generation approach using pre-trained language models, achieving strong performance across domains with limited data, and outperforming existing baselines by over 8 BLEU points.

Contribution

It proposes a novel few-shot NLG method leveraging pre-trained models, demonstrating effective content selection and language modeling with minimal training data.

Findings

01

Achieves over 8 BLEU points improvement with 200 examples

02

Demonstrates strong cross-domain generalization

03

Outperforms existing baselines significantly

Abstract

Neural-based end-to-end approaches to natural language generation (NLG) from structured data or knowledge are data-hungry, making their adoption for real-world applications difficult with limited data. In this work, we propose the new task of \textit{few-shot natural language generation}. Motivated by how humans tend to summarize tabular data, we propose a simple yet effective approach and show that it not only demonstrates strong performance but also provides good generalization across domains. The design of the model architecture is based on two aspects: content selection from input data and language modeling to compose coherent sentences, which can be acquired from prior knowledge. With just 200 training examples, across multiple domains, we show that our approach achieves very reasonable performances and outperforms the strongest baseline by an average of over 8.0 BLEU points…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications