Progressively Label Enhancement for Large Language Model Alignment
Biao Liu, Ning Xu, Xin Geng

TL;DR
This paper introduces PLE, a dynamic framework for LLM alignment that improves training efficiency by progressively adjusting data labeling based on response quality, outperforming existing methods.
Contribution
The paper presents a novel progressive label enhancement framework that dynamically guides LLM training using evolving data quality, addressing limitations of static data generation methods.
Findings
PLE outperforms existing LLM alignment methods in experiments.
Dynamic thresholding improves response quality during training.
The approach enhances data utilization efficiency.
Abstract
Large Language Models (LLM) alignment aims to prevent models from producing content that misaligns with human expectations, which can lead to ethical and legal concerns. In the last few years, Reinforcement Learning from Human Feedback (RLHF) has been the most prominent method for achieving alignment. Due to challenges in stability and scalability with RLHF stages, which arise from the complex interactions between multiple models, researchers are exploring alternative methods to achieve effects comparable to those of RLHF. However, these methods often rely on large high-quality datasets. Despite some methods considering the generation of additional data to expand datasets, they often treat model training and data generation as separate and static processes, overlooking the fact that these processes are highly interdependent, leading to inefficient utilization of the generated data. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
MethodsSparse Evolutionary Training · ALIGN
