Uncertainty Aware Learning for Language Model Alignment

Yikun Wang; Rui Zheng; Liang Ding; Qi Zhang; Dahua Lin; Dacheng Tao

arXiv:2406.04854·cs.CL·June 10, 2024·1 cites

Uncertainty Aware Learning for Language Model Alignment

Yikun Wang, Rui Zheng, Liang Ding, Qi Zhang, Dahua Lin, Dacheng Tao

PDF

Open Access 1 Video

TL;DR

This paper introduces uncertainty-aware learning (UAL) for language model alignment, adaptively adjusting training based on sample uncertainty to improve performance across diverse tasks.

Contribution

The paper proposes a novel UAL method that incorporates sample uncertainty into training, enhancing alignment and efficiency of large language models.

Findings

01

UAL improves token clustering in feature space.

02

Significant performance gains on benchmarks, e.g., 10.62% on high-entropy tasks.

03

Consistent outperformance over standard fine-tuning methods.

Abstract

As instruction-tuned large language models (LLMs) evolve, aligning pretrained foundation models presents increasing challenges. Existing alignment strategies, which typically leverage diverse and high-quality data sources, often overlook the intrinsic uncertainty of tasks, learning all data samples equally. This may lead to suboptimal data efficiency and model performance. In response, we propose uncertainty-aware learning (UAL) to improve the model alignment of different task scenarios, by introducing the sample uncertainty (elicited from more capable LLMs). We implement UAL in a simple fashion -- adaptively setting the label smoothing value of training according to the uncertainty of individual samples. Analysis shows that our UAL indeed facilitates better token clustering in the feature space, validating our hypothesis. Extensive experiments on widely used benchmarks demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Uncertainty Aware Learning for Language Model Alignment· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsLabel Smoothing