A Systematic Examination of Preference Learning through the Lens of Instruction-Following
Joongwon Kim, Anirudh Goyal, Aston Zhang, Bo Xiong, Rui Hou, Melanie, Kambadur, Dhruv Mahajan, Hannaneh Hajishirzi, Liang Tan

TL;DR
This paper systematically studies how different attributes of preference datasets influence the alignment and performance of large language models in instruction-following tasks, using synthetic data and novel curation methods.
Contribution
It introduces a synthetic data generation pipeline and evaluates the effects of dataset attributes on LLM alignment, providing insights for optimizing preference data curation.
Findings
Shared prefixes in preference pairs improve stability and performance.
High-contrast preference pairs enhance learning efficiency.
Moderate difficulty prompts lead to better generalization.
Abstract
Preference learning is a widely adopted post-training technique that aligns large language models (LLMs) to human preferences and improves specific downstream task capabilities. In this work we systematically investigate how specific attributes of preference datasets affect the alignment and downstream performance of LLMs in instruction-following tasks. We use a novel synthetic data generation pipeline to generate 48,000 unique instruction-following prompts with combinations of 23 verifiable constraints that enable fine-grained and automated quality assessments of model responses. With our synthetic prompts, we use two preference dataset curation methods - rejection sampling (RS) and Monte Carlo Tree Search (MCTS) - to obtain pairs of (chosen, rejected) responses. Then, we perform experiments investigating the effects of (1) the presence of shared prefixes between the chosen and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsInnovative Teaching and Learning Methods · Education and Learning Interventions · Education, Safety, and Science Studies
