MVP: Multi-task Supervised Pre-training for Natural Language Generation
Tianyi Tang, Junyi Li, Wayne Xin Zhao, Ji-Rong Wen

TL;DR
This paper introduces MVP, a multi-task supervised pre-training approach for natural language generation that unifies diverse datasets into a text-to-text format, leading to state-of-the-art results on multiple NLG tasks.
Contribution
The paper presents a large-scale supervised pre-training method using a unified dataset and soft prompts, improving NLG performance over existing models.
Findings
Achieves state-of-the-art results on 13 out of 17 datasets.
Outperforms BART by 9.3% and Flan-T5 by 5.8%.
Demonstrates the effectiveness of supervised multi-task pre-training.
Abstract
Pre-trained language models (PLMs) have achieved remarkable success in natural language generation (NLG) tasks. Up to now, most NLG-oriented PLMs are pre-trained in an unsupervised manner using the large-scale general corpus. In the meanwhile, an increasing number of models pre-trained with labeled data (i.e. "supervised pre-training") showcase superior performance compared to unsupervised pre-trained models. Motivated by the success of supervised pre-training, we propose Multi-task superVised Pre-training (MVP) for natural language generation. We collect a large-scale natural language generation corpus, MVPCorpus, from datasets over diverse NLG tasks. Then we unify these examples into a general text-to-text format to pre-train the text generation model MVP in a supervised manner. For each task, we further pre-train specific soft prompts to stimulate the model's capacity to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗RUCAIBox/mvpmodel· 2.0k dl· ♡ 72.0k dl♡ 7
- 🤗RUCAIBox/mvp-summarizationmodel· 13 dl13 dl
- 🤗RUCAIBox/mvp-data-to-textmodel· 14 dl· ♡ 414 dl♡ 4
- 🤗RUCAIBox/mvp-open-dialogmodel· 31 dl· ♡ 131 dl♡ 1
- 🤗RUCAIBox/mvp-task-dialogmodel· 8 dl· ♡ 38 dl♡ 3
- 🤗RUCAIBox/mvp-question-generationmodel· 11 dl· ♡ 111 dl♡ 1
- 🤗RUCAIBox/mvp-question-answeringmodel· 13 dl· ♡ 213 dl♡ 2
- 🤗RUCAIBox/mvp-storymodel· 12 dl· ♡ 312 dl♡ 3
- 🤗RUCAIBox/mtl-storymodel· 7 dl· ♡ 17 dl♡ 1
- 🤗RUCAIBox/mtl-question-answeringmodel· 9 dl· ♡ 19 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Dense Connections · Softmax · Layer Normalization · Byte Pair Encoding · Dropout · Linear Layer · Residual Connection
