Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models
Xudong Han, Junjie Yang, Tianyang Wang, Ziqian Bi, Xinyuan Song, Junfeng Hao, and Junhao Song

TL;DR
This survey reviews instruction tuning techniques for large language models, covering data collection, fine-tuning methods, evaluation challenges, and future directions to improve alignment with human goals.
Contribution
It provides a comprehensive categorization of data paradigms, fine-tuning strategies, and evaluation protocols, highlighting recent advances and future research directions in instruction tuning.
Findings
Data construction paradigms vary in quality and scalability.
Lightweight fine-tuning methods like LoRA improve efficiency.
Evaluation of faithfulness and safety remains challenging.
Abstract
Instruction tuning is a pivotal technique for aligning large language models (LLMs) with human intentions, safety constraints, and domain-specific requirements. This survey provides a comprehensive overview of the full pipeline, encompassing (i) data collection methodologies, (ii) full-parameter and parameter-efficient fine-tuning strategies, and (iii) evaluation protocols. We categorized data construction into three major paradigms: expert annotation, distillation from larger models, and self-improvement mechanisms, each offering distinct trade-offs between quality, scalability, and resource cost. Fine-tuning techniques range from conventional supervised training to lightweight approaches, such as low-rank adaptation (LoRA) and prefix tuning, with a focus on computational efficiency and model reusability. We further examine the challenges of evaluating faithfulness, utility, and safety…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
