Recent Advances in Natural Language Processing via Large Pre-Trained   Language Models: A Survey

Bonan Min; Hayley Ross; Elior Sulem; Amir Pouran Ben Veyseh; Thien Huu; Nguyen; Oscar Sainz; Eneko Agirre; Ilana Heinz; and Dan Roth

arXiv:2111.01243·cs.CL·November 3, 2021·161 cites

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu, Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heinz, and Dan Roth

PDF

Open Access

TL;DR

This survey reviews recent progress in NLP driven by large pre-trained transformer models, covering methods like fine-tuning, prompting, data augmentation, and discussing current limitations and future directions.

Contribution

It provides a comprehensive overview of recent techniques and applications of large pre-trained language models in NLP, highlighting new methods and research trends.

Findings

01

Large pre-trained models significantly improve NLP performance.

02

Various approaches like fine-tuning, prompting, and data augmentation are effective.

03

Identifies current limitations and future research directions.

Abstract

Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field. We present a survey of recent work that uses these large language models to solve NLP tasks via pre-training then fine-tuning, prompting, or text generation approaches. We also present approaches that use pre-trained language models to generate data for training augmentation or other purposes. We conclude with discussions on limitations and suggested directions for future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Residual Connection · Layer Normalization · Softmax · Weight Decay · WordPiece