Preference Consistency Matters: Enhancing Preference Learning in   Language Models with Automated Self-Curation of Training Corpora

JoonHo Lee; JuYoun Son; Juree Seok; Wooseok Jang; Yeong-Dae Kwon

arXiv:2408.12799·cs.CL·February 3, 2025

Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-Curation of Training Corpora

JoonHo Lee, JuYoun Son, Juree Seok, Wooseok Jang, Yeong-Dae Kwon

PDF

Open Access 1 Video

TL;DR

This paper introduces a self-curation method that improves preference learning in language models by automatically identifying and selecting consistent annotations, leading to significant performance gains.

Contribution

It presents a novel automated self-curation approach that addresses annotation inconsistencies in preference datasets, enhancing preference learning without heuristics.

Findings

01

Performance improvements of up to 33% in instruction-following tasks

02

Effective detection and selection of consistent annotations

03

Applicable across various learning algorithms and proxy models

Abstract

Inconsistent annotations in training corpora, particularly within preference learning datasets, pose challenges in developing advanced language models. These inconsistencies often arise from variability among annotators and inherent multi-dimensional nature of the preferences. To address these issues, we introduce a self-curation method that preprocesses annotated datasets by leveraging proxy models trained directly on them. Our method enhances preference learning by automatically detecting and selecting consistent annotations. We validate the proposed approach through extensive instruction-following tasks, demonstrating performance improvements of up to 33\% across various learning algorithms and proxy capabilities. This work offers a straightforward and reliable solution to address preference inconsistencies without relying on heuristics, serving as an initial step toward the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-Curation of Training Corpora· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems