reStructured Pre-training
Weizhe Yuan, Pengfei Liu

TL;DR
This paper introduces reStructured Pre-training (RST), a new NLP paradigm emphasizing data storage and access, leading to models that outperform existing methods on diverse NLP tasks and standardized exams.
Contribution
The paper proposes the RST paradigm, operationalizes data restructuring for pre-training, and demonstrates significant performance improvements across multiple NLP benchmarks and exams.
Findings
RST models outperform strong competitors on 52/55 NLP datasets.
Qin achieves 40 points higher than average students in Gaokao-English.
Qin surpasses GPT-3 in recent English exam scores.
Abstract
In this work, we try to decipher the internal connection of NLP technology development in the past decades, searching for essence, which rewards us with a (potential) new learning paradigm for NLP tasks, dubbed as reStructured Pre-training (RST). In such a paradigm, the role of data will be re-emphasized, and model pre-training and fine-tuning of downstream tasks are viewed as a process of data storing and accessing. Based on that, we operationalize the simple principle that a good storage mechanism should not only have the ability to cache a large amount of data but also consider the ease of access. We achieve this by pre-training models over restructured data that consist of a variety of valuable information instead of raw data after overcoming several engineering challenges. Experimentally, RST models not only surpass strong competitors (e.g., T0) on 52/55 popular datasets from a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗GAIR/rst-fact-retrieval-11bmodel· 3 dl· ♡ 63 dl♡ 6
- 🤗GAIR/rst-information-extraction-11bmodel· 3 dl· ♡ 103 dl♡ 10
- 🤗GAIR/rst-intent-detection-11bmodel· 3 dl· ♡ 63 dl♡ 6
- 🤗GAIR/rst-natural-language-inference-11bmodel· 2 dl· ♡ 22 dl♡ 2
- 🤗GAIR/rst-sentiment-classification-11bmodel· 2 dl· ♡ 22 dl♡ 2
- 🤗GAIR/rst-summarization-11bmodel· 5 dl· ♡ 25 dl♡ 2
- 🤗GAIR/rst-temporal-reasoning-11bmodel· 5 dl· ♡ 25 dl♡ 2
- 🤗GAIR/rst-topic-classification-11bmodel· 6 dl· ♡ 46 dl♡ 4
- 🤗GAIR/rst-word-sense-disambiguation-11bmodel· 4 dl· ♡ 54 dl♡ 5
- 🤗GAIR/rst-all-11bmodel· 6 dl· ♡ 116 dl♡ 11
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsTest
