Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu, Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heinz, and Dan Roth

TL;DR
This survey reviews recent progress in NLP driven by large pre-trained transformer models, covering methods like fine-tuning, prompting, data augmentation, and discussing current limitations and future directions.
Contribution
It provides a comprehensive overview of recent techniques and applications of large pre-trained language models in NLP, highlighting new methods and research trends.
Findings
Large pre-trained models significantly improve NLP performance.
Various approaches like fine-tuning, prompting, and data augmentation are effective.
Identifies current limitations and future research directions.
Abstract
Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field. We present a survey of recent work that uses these large language models to solve NLP tasks via pre-training then fine-tuning, prompting, or text generation approaches. We also present approaches that use pre-trained language models to generate data for training augmentation or other purposes. We conclude with discussions on limitations and suggested directions for future research.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Residual Connection · Layer Normalization · Softmax · Weight Decay · WordPiece
