Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

Xudong Han; Junjie Yang; Tianyang Wang; Ziqian Bi; Xinyuan Song; Junfeng Hao; and Junhao Song

arXiv:2508.17184·cs.CL·November 20, 2025

Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

Xudong Han, Junjie Yang, Tianyang Wang, Ziqian Bi, Xinyuan Song, Junfeng Hao, and Junhao Song

PDF

TL;DR

This survey reviews instruction tuning techniques for large language models, covering data collection, fine-tuning methods, evaluation challenges, and future directions to improve alignment with human goals.

Contribution

It provides a comprehensive categorization of data paradigms, fine-tuning strategies, and evaluation protocols, highlighting recent advances and future research directions in instruction tuning.

Findings

01

Data construction paradigms vary in quality and scalability.

02

Lightweight fine-tuning methods like LoRA improve efficiency.

03

Evaluation of faithfulness and safety remains challenging.

Abstract

Instruction tuning is a pivotal technique for aligning large language models (LLMs) with human intentions, safety constraints, and domain-specific requirements. This survey provides a comprehensive overview of the full pipeline, encompassing (i) data collection methodologies, (ii) full-parameter and parameter-efficient fine-tuning strategies, and (iii) evaluation protocols. We categorized data construction into three major paradigms: expert annotation, distillation from larger models, and self-improvement mechanisms, each offering distinct trade-offs between quality, scalability, and resource cost. Fine-tuning techniques range from conventional supervised training to lightweight approaches, such as low-rank adaptation (LoRA) and prefix tuning, with a focus on computational efficiency and model reusability. We further examine the challenges of evaluating faithfulness, utility, and safety…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.