A Deep Dive into the Trade-Offs of Parameter-Efficient Preference   Alignment Techniques

Megh Thakkar; Quentin Fournier; Matthew D Riemer; Pin-Yu Chen; Amal; Zouaq; Payel Das; Sarath Chandar

arXiv:2406.04879·cs.CL·June 10, 2024

A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques

Megh Thakkar, Quentin Fournier, Matthew D Riemer, Pin-Yu Chen, Amal, Zouaq, Payel Das, Sarath Chandar

PDF

Open Access

TL;DR

This paper systematically investigates how different choices in parameter-efficient preference alignment methods affect large language model performance, providing insights and guidelines for more effective alignment strategies.

Contribution

It offers an extensive empirical study of the effects of datasets, techniques, and models on preference alignment, revealing key trends and unexpected results.

Findings

01

More informative data improves preference alignment.

02

Supervised fine-tuning can outperform preference optimization in some cases.

03

Aligning to specific preferences enhances downstream task performance.

Abstract

Large language models are first pre-trained on trillions of tokens and then instruction-tuned or aligned to specific preferences. While pre-training remains out of reach for most researchers due to the compute required, fine-tuning has become affordable thanks to parameter-efficient methods such as LoRA and QLoRA. Alignment is known to be sensitive to the many factors involved, including the quantity and quality of data, the alignment method, and the adapter rank. However, there has not yet been an extensive study of their effect on downstream performance. To address this gap, we conduct an in-depth investigation of the impact of popular choices for three crucial axes: (i) the alignment dataset (HH-RLHF and BeaverTails), (ii) the alignment technique (SFT and DPO), and (iii) the model (LLaMA-1, Vicuna-v1.3, Mistral-7b, and Mistral-7b-Instruct). Our extensive setup spanning over 300…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Management and Algorithms

MethodsAdapter