A critical review of methods and challenges in large language models

Milad Moradi; Ke Yan; David Colwell; Matthias Samwald; Rhona Asgari

arXiv:2404.11973·cs.AI·September 29, 2025·5 cites

A critical review of methods and challenges in large language models

Milad Moradi, Ke Yan, David Colwell, Matthias Samwald, Rhona Asgari

PDF

Open Access

TL;DR

This review critically analyzes the evolution, techniques, and challenges of large language models, emphasizing advancements, alignment methods, and ethical considerations to guide future research and responsible deployment.

Contribution

It provides a comprehensive overview of LLM architectures, training methods, alignment strategies, and ethical issues, highlighting current gaps and future research directions.

Findings

01

Transformers have significantly advanced LLM capabilities.

02

In-context learning and fine-tuning improve model efficiency.

03

Alignment with human preferences remains a key challenge.

Abstract

This critical review provides an in-depth analysis of Large Language Models (LLMs), encompassing their foundational principles, diverse applications, and advanced training methodologies. We critically examine the evolution from Recurrent Neural Networks (RNNs) to Transformer models, highlighting the significant advancements and innovations in LLM architectures. The review explores state-of-the-art techniques such as in-context learning and various fine-tuning approaches, with an emphasis on optimizing parameter efficiency. We also discuss methods for aligning LLMs with human preferences, including reinforcement learning frameworks and human feedback mechanisms. The emerging technique of retrieval-augmented generation, which integrates external knowledge into LLMs, is also evaluated. Additionally, we address the ethical considerations of deploying LLMs, stressing the importance of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsFocus