Improving Instruction Following in Language Models through Proxy-Based   Uncertainty Estimation

JoonHo Lee; Jae Oh Woo; Juree Seok; Parisa Hassanzadeh; Wooseok Jang,; JuYoun Son; Sima Didari; Baruch Gutow; Heng Hao; Hankyu Moon; Wenjun Hu,; Yeong-Dae Kwon; Taehee Lee; Seungjai Min

arXiv:2405.06424·cs.CL·February 3, 2025

Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation

JoonHo Lee, Jae Oh Woo, Juree Seok, Parisa Hassanzadeh, Wooseok Jang,, JuYoun Son, Sima Didari, Baruch Gutow, Heng Hao, Hankyu Moon, Wenjun Hu,, Yeong-Dae Kwon, Taehee Lee, Seungjai Min

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Bayesian uncertainty estimation method for assessing response quality in language models, significantly improving instruction following by refining training data and policy optimization.

Contribution

We propose a novel Uncertainty-aware Reward Model that estimates response quality and uncertainty, enhancing language model training and performance on benchmarks.

Findings

01

Boosts instruction following capabilities of language models.

02

Outperforms existing methods on Vicuna and MT-bench benchmarks.

03

Enhances data curation and policy optimization through uncertainty estimation.

Abstract

Assessing response quality to instructions in language models is vital but challenging due to the complexity of human language across different contexts. This complexity often results in ambiguous or inconsistent interpretations, making accurate assessment difficult. To address this issue, we propose a novel Uncertainty-aware Reward Model (URM) that introduces a robust uncertainty estimation for the quality of paired responses based on Bayesian approximation. Trained with preference datasets, our uncertainty-enabled proxy not only scores rewards for responses but also evaluates their inherent uncertainty. Empirical results demonstrate significant benefits of incorporating the proposed proxy into language model training. Our method boosts the instruction following capability of language models by refining data curation for training and improving policy optimization objectives, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

p-b-u/proxy_based_uncertainty
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications