Progressively Label Enhancement for Large Language Model Alignment

Biao Liu; Ning Xu; Xin Geng

arXiv:2408.02599·cs.CL·October 10, 2024

Progressively Label Enhancement for Large Language Model Alignment

Biao Liu, Ning Xu, Xin Geng

PDF

Open Access

TL;DR

This paper introduces PLE, a dynamic framework for LLM alignment that improves training efficiency by progressively adjusting data labeling based on response quality, outperforming existing methods.

Contribution

The paper presents a novel progressive label enhancement framework that dynamically guides LLM training using evolving data quality, addressing limitations of static data generation methods.

Findings

01

PLE outperforms existing LLM alignment methods in experiments.

02

Dynamic thresholding improves response quality during training.

03

The approach enhances data utilization efficiency.

Abstract

Large Language Models (LLM) alignment aims to prevent models from producing content that misaligns with human expectations, which can lead to ethical and legal concerns. In the last few years, Reinforcement Learning from Human Feedback (RLHF) has been the most prominent method for achieving alignment. Due to challenges in stability and scalability with RLHF stages, which arise from the complex interactions between multiple models, researchers are exploring alternative methods to achieve effects comparable to those of RLHF. However, these methods often rely on large high-quality datasets. Despite some methods considering the generation of additional data to expand datasets, they often treat model training and data generation as separate and static processes, overlooking the fact that these processes are highly interdependent, leading to inefficient utilization of the generated data. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis

MethodsSparse Evolutionary Training · ALIGN