reStructured Pre-training

Weizhe Yuan; Pengfei Liu

arXiv:2206.11147·cs.CL·September 9, 2022·5 cites

reStructured Pre-training

Weizhe Yuan, Pengfei Liu

PDF

Open Access 2 Repos 10 Models

TL;DR

This paper introduces reStructured Pre-training (RST), a new NLP paradigm emphasizing data storage and access, leading to models that outperform existing methods on diverse NLP tasks and standardized exams.

Contribution

The paper proposes the RST paradigm, operationalizes data restructuring for pre-training, and demonstrates significant performance improvements across multiple NLP benchmarks and exams.

Findings

01

RST models outperform strong competitors on 52/55 NLP datasets.

02

Qin achieves 40 points higher than average students in Gaokao-English.

03

Qin surpasses GPT-3 in recent English exam scores.

Abstract

In this work, we try to decipher the internal connection of NLP technology development in the past decades, searching for essence, which rewards us with a (potential) new learning paradigm for NLP tasks, dubbed as reStructured Pre-training (RST). In such a paradigm, the role of data will be re-emphasized, and model pre-training and fine-tuning of downstream tasks are viewed as a process of data storing and accessing. Based on that, we operationalize the simple principle that a good storage mechanism should not only have the ability to cache a large amount of data but also consider the ease of access. We achieve this by pre-training models over restructured data that consist of a variety of valuable information instead of raw data after overcoming several engineering challenges. Experimentally, RST models not only surpass strong competitors (e.g., T0) on 52/55 popular datasets from a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsTest