ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
Dongling Xiao, Han Zhang, Yukun Li, Yu Sun, Hao Tian, Hua Wu and, Haifeng Wang

TL;DR
ERNIE-GEN introduces a multi-flow pre-training framework for natural language generation that reduces exposure bias and improves performance across various tasks with less data and fewer parameters.
Contribution
It proposes a novel span-by-span generation and multi-granularity sampling approach to enhance pre-training effectiveness and alignment between training and inference.
Findings
Achieves state-of-the-art results on multiple language generation tasks.
Uses fewer pre-training data and parameters than existing methods.
Demonstrates improved generation quality closer to human writing.
Abstract
Current pre-training works in natural language generation pay little attention to the problem of exposure bias on downstream tasks. To address this issue, we propose an enhanced multi-flow sequence to sequence pre-training and fine-tuning framework named ERNIE-GEN, which bridges the discrepancy between training and inference with an infilling generation mechanism and a noise-aware generation method. To make generation closer to human writing patterns, this framework introduces a span-by-span generation flow that trains the model to predict semantically-complete spans consecutively rather than predicting word by word. Unlike existing pre-training methods, ERNIE-GEN incorporates multi-granularity target sampling to construct pre-training data, which enhances the correlation between encoder and decoder. Experimental results demonstrate that ERNIE-GEN achieves state-of-the-art results with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- MindCode-4/code-10/tree/main/FDDE/train-finemindspore
- https-github-com-GiangHoang9912/ernie-genpaddle
- Sharpiless/Versailles-text-generation-with-paddlepaddlepaddle
- PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/transformers/ernie_gen/modeling.pypaddle
- MindSpore-scientific/code-12/tree/main/FDDE/train-finemindspore
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsERNIE-GEN
