Deep Bayesian Active Learning for Preference Modeling in Large Language   Models

Luckeciano C. Melo; Panagiotis Tigas; Alessandro Abate; Yarin Gal

arXiv:2406.10023·cs.LG·October 30, 2024·1 cites

Deep Bayesian Active Learning for Preference Modeling in Large Language Models

Luckeciano C. Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces BAL-PM, a Bayesian active learning method that efficiently selects informative data points for preference modeling in large language models, significantly reducing labeling costs.

Contribution

The paper proposes a novel stochastic acquisition policy, BAL-PM, that improves preference data selection by balancing uncertainty and entropy, outperforming previous methods.

Findings

01

BAL-PM reduces preference label requirements by 33% to 68%.

02

It outperforms previous stochastic Bayesian acquisition policies.

03

Demonstrates effectiveness on popular human preference datasets.

Abstract

Leveraging human preferences for steering the behavior of Large Language Models (LLMs) has demonstrated notable success in recent years. Nonetheless, data selection and labeling are still a bottleneck for these systems, particularly at large scale. Hence, selecting the most informative points for acquiring human feedback may considerably reduce the cost of preference labeling and unleash the further development of LLMs. Bayesian Active Learning provides a principled framework for addressing this challenge and has demonstrated remarkable success in diverse settings. However, previous attempts to employ it for Preference Modeling did not meet such expectations. In this work, we identify that naive epistemic uncertainty estimation leads to the acquisition of redundant samples. We address this by proposing the Bayesian Active Learner for Preference Modeling (BAL-PM), a novel stochastic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luckeciano/bal-pm
pytorchOfficial

Videos

Deep Bayesian Active Learning for Preference Modeling in Large Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms