MetaAlign: Align Large Language Models with Diverse Preferences during   Inference Time

Mozhi Zhang; Pengyu Wang; Chenkun Tan; Mianqiu Huang; Dong Zhang,; Yaqian Zhou; Xipeng Qiu

arXiv:2410.14184·cs.CL·October 21, 2024

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

Mozhi Zhang, Pengyu Wang, Chenkun Tan, Mianqiu Huang, Dong Zhang,, Yaqian Zhou, Xipeng Qiu

PDF

Open Access 1 Repo

TL;DR

MetaAlign enables large language models to dynamically adapt to diverse human preferences during inference, overcoming the static limitations of traditional alignment methods like RLHF and DPO.

Contribution

We introduce MetaAlign, a novel approach allowing LLMs to align with various preferences specified at inference time, enhancing flexibility over existing static alignment techniques.

Findings

01

LLMs trained with MetaAlign effectively adapt to different preferences during inference.

02

MetaAlign outperforms static alignment methods in aligning with diverse preferences.

03

Experimental results validate the feasibility of dynamic preference alignment in LLMs.

Abstract

Large Language Models (LLMs) acquire extensive knowledge and remarkable abilities from extensive text corpora, making them powerful tools for various applications. To make LLMs more usable, aligning them with human preferences is essential. Existing alignment techniques, such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO), typically embed predefined preferences directly within the model's parameters. These methods, however, often result in a static alignment that can not account for the diversity of human preferences in practical applications. In response to this challenge, we propose an effective method, \textbf{MetaAlign}, which aims to help LLMs dynamically align with various explicit or implicit preferences specified at inference time. Experimental results show that LLMs optimized on our meticulously constructed MetaAlign Dataset can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Jihuai-wpy/MetaAlign
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsALIGN