A critical review of methods and challenges in large language models
Milad Moradi, Ke Yan, David Colwell, Matthias Samwald, Rhona Asgari

TL;DR
This review critically analyzes the evolution, techniques, and challenges of large language models, emphasizing advancements, alignment methods, and ethical considerations to guide future research and responsible deployment.
Contribution
It provides a comprehensive overview of LLM architectures, training methods, alignment strategies, and ethical issues, highlighting current gaps and future research directions.
Findings
Transformers have significantly advanced LLM capabilities.
In-context learning and fine-tuning improve model efficiency.
Alignment with human preferences remains a key challenge.
Abstract
This critical review provides an in-depth analysis of Large Language Models (LLMs), encompassing their foundational principles, diverse applications, and advanced training methodologies. We critically examine the evolution from Recurrent Neural Networks (RNNs) to Transformer models, highlighting the significant advancements and innovations in LLM architectures. The review explores state-of-the-art techniques such as in-context learning and various fine-tuning approaches, with an emphasis on optimizing parameter efficiency. We also discuss methods for aligning LLMs with human preferences, including reinforcement learning frameworks and human feedback mechanisms. The emerging technique of retrieval-augmented generation, which integrates external knowledge into LLMs, is also evaluated. Additionally, we address the ethical considerations of deploying LLMs, stressing the importance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
MethodsFocus
