Aligning Large Language Models with Human: A Survey

Yufei Wang; Wanjun Zhong; Liangyou Li; Fei Mi; Xingshan Zeng; Wenyong; Huang; Lifeng Shang; Xin Jiang; Qun Liu

arXiv:2307.12966·cs.CL·July 25, 2023·54 cites

Aligning Large Language Models with Human: A Survey

Yufei Wang, Wanjun Zhong, Liangyou Li, Fei Mi, Xingshan Zeng, Wenyong, Huang, Lifeng Shang, Xin Jiang, Qun Liu

PDF

Open Access 1 Repo

TL;DR

This survey reviews methods for aligning large language models with human expectations, covering data collection, training techniques, and evaluation methods, highlighting current challenges and future research directions.

Contribution

It provides a comprehensive overview of LLM alignment technologies, including data collection, training methodologies, and evaluation strategies, serving as a valuable resource for future research.

Findings

01

Effective data collection methods for LLM alignment

02

Diverse training methodologies including supervised and preference-based approaches

03

Multifaceted evaluation strategies for aligned LLMs

Abstract

Large Language Models (LLMs) trained on extensive textual corpora have emerged as leading solutions for a broad array of Natural Language Processing (NLP) tasks. Despite their notable performance, these models are prone to certain limitations such as misunderstanding human instructions, generating potentially biased content, or factually incorrect (hallucinated) information. Hence, aligning LLMs with human expectations has become an active area of interest within the research community. This survey presents a comprehensive overview of these alignment technologies, including the following aspects. (1) Data collection: the methods for effectively collecting high-quality instructions for LLM alignment, including the use of NLP benchmarks, human annotations, and leveraging strong LLMs. (2) Training methodologies: a detailed review of the prevailing training methods employed for LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

garyyufei/alignllmhumansurvey
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Materials Science