Disentangling Preference Representation and Text Generation for   Efficient Individual Preference Alignment

Jianfei Zhang; Jun Bai; Bei Li; Yanmeng Wang; Rumei Li; Chenghua Lin,; Wenge Rong

arXiv:2412.20834·cs.CL·December 31, 2024

Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment

Jianfei Zhang, Jun Bai, Bei Li, Yanmeng Wang, Rumei Li, Chenghua Lin,, Wenge Rong

PDF

Open Access 1 Repo

TL;DR

This paper proposes a new method for efficiently aligning large language models with individual human preferences by disentangling preference representation from text generation, significantly reducing training time.

Contribution

The paper introduces a flexible paradigm that improves alignment efficiency by separating preference encoding from text generation in LLMs, outperforming existing PEFT-based methods.

Findings

01

Achieves comparable or better alignment quality than PEFT-based methods.

02

Reduces additional training time for new preferences by 80-90%.

03

Validated across multiple text generation tasks.

Abstract

Aligning Large Language Models (LLMs) with general human preferences has been proved crucial in improving the interaction quality between LLMs and human. However, human values are inherently diverse among different individuals, making it insufficient to align LLMs solely with general preferences. To address this, personalizing LLMs according to individual feedback emerges as a promising solution. Nonetheless, this approach presents challenges in terms of the efficiency of alignment algorithms. In this work, we introduce a flexible paradigm for individual preference alignment. Our method fundamentally improves efficiency by disentangling preference representation from text generation in LLMs. We validate our approach across multiple text generation tasks and demonstrate that it can produce aligned quality as well as or better than PEFT-based methods, while reducing additional training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangjf-nlp/LatentDPO
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Data Management and Algorithms · Topic Modeling

MethodsALIGN