A Systematic Examination of Preference Learning through the Lens of   Instruction-Following

Joongwon Kim; Anirudh Goyal; Aston Zhang; Bo Xiong; Rui Hou; Melanie; Kambadur; Dhruv Mahajan; Hannaneh Hajishirzi; Liang Tan

arXiv:2412.15282·cs.CL·December 23, 2024

A Systematic Examination of Preference Learning through the Lens of Instruction-Following

Joongwon Kim, Anirudh Goyal, Aston Zhang, Bo Xiong, Rui Hou, Melanie, Kambadur, Dhruv Mahajan, Hannaneh Hajishirzi, Liang Tan

PDF

Open Access 1 Video

TL;DR

This paper systematically studies how different attributes of preference datasets influence the alignment and performance of large language models in instruction-following tasks, using synthetic data and novel curation methods.

Contribution

It introduces a synthetic data generation pipeline and evaluates the effects of dataset attributes on LLM alignment, providing insights for optimizing preference data curation.

Findings

01

Shared prefixes in preference pairs improve stability and performance.

02

High-contrast preference pairs enhance learning efficiency.

03

Moderate difficulty prompts lead to better generalization.

Abstract

Preference learning is a widely adopted post-training technique that aligns large language models (LLMs) to human preferences and improves specific downstream task capabilities. In this work we systematically investigate how specific attributes of preference datasets affect the alignment and downstream performance of LLMs in instruction-following tasks. We use a novel synthetic data generation pipeline to generate 48,000 unique instruction-following prompts with combinations of 23 verifiable constraints that enable fine-grained and automated quality assessments of model responses. With our synthetic prompts, we use two preference dataset curation methods - rejection sampling (RS) and Monte Carlo Tree Search (MCTS) - to obtain pairs of (chosen, rejected) responses. Then, we perform experiments investigating the effects of (1) the presence of shared prefixes between the chosen and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Systematic Examination of Preference Learning through the Lens of Instruction-Following· underline

Taxonomy

TopicsInnovative Teaching and Learning Methods · Education and Learning Interventions · Education, Safety, and Science Studies