KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation
Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang

TL;DR
KGPT introduces a knowledge-grounded pre-training approach for data-to-text generation, enabling effective transfer learning across tasks and domains with minimal labeled data, achieving state-of-the-art results especially in zero-shot and few-shot scenarios.
Contribution
The paper presents a novel pre-training paradigm leveraging web-crawled knowledge-grounded text to improve data-to-text generation, especially in low-resource settings.
Findings
Achieves over 30 ROUGE-L in zero-shot WebNLG
Requires only one-fifteenth of labeled data in few-shot settings
Demonstrates strong generalization across tasks and domains
Abstract
Data-to-text generation has recently attracted substantial interests due to its wide applications. Existing methods have shown impressive performance on an array of tasks. However, they rely on a significant amount of labeled data for each task, which is costly to acquire and thus limits their application to new tasks and domains. In this paper, we propose to leverage pre-training and transfer learning to address this issue. We propose a knowledge-grounded pre-training (KGPT), which consists of two parts, 1) a general knowledge-grounded generation model to generate knowledge-enriched text. 2) a pre-training paradigm on a massive knowledge-grounded text corpus crawled from the web. The pre-trained model can be fine-tuned on various data-to-text generation tasks to generate task-specific text. We adopt three settings, namely fully-supervised, zero-shot, few-shot to evaluate its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
