Aligning Large Language Models with Human: A Survey
Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong, Huang, Lifeng Shang, Xin Jiang, Qun Liu

TL;DR
This survey reviews methods for aligning large language models with human expectations, covering data collection, training techniques, and evaluation methods, highlighting current challenges and future research directions.
Contribution
It provides a comprehensive overview of LLM alignment technologies, including data collection, training methodologies, and evaluation strategies, serving as a valuable resource for future research.
Findings
Effective data collection methods for LLM alignment
Diverse training methodologies including supervised and preference-based approaches
Multifaceted evaluation strategies for aligned LLMs
Abstract
Large Language Models (LLMs) trained on extensive textual corpora have emerged as leading solutions for a broad array of Natural Language Processing (NLP) tasks. Despite their notable performance, these models are prone to certain limitations such as misunderstanding human instructions, generating potentially biased content, or factually incorrect (hallucinated) information. Hence, aligning LLMs with human expectations has become an active area of interest within the research community. This survey presents a comprehensive overview of these alignment technologies, including the following aspects. (1) Data collection: the methods for effectively collecting high-quality instructions for LLM alignment, including the use of NLP benchmarks, human annotations, and leveraging strong LLMs. (2) Training methodologies: a detailed review of the prevailing training methods employed for LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Materials Science
