Aligning LLMs with Individual Preferences via Interaction

Shujin Wu; May Fung; Cheng Qian; Jeonghwan Kim; Dilek Hakkani-Tur,; Heng Ji

arXiv:2410.03642·cs.CL·December 17, 2024

Aligning LLMs with Individual Preferences via Interaction

Shujin Wu, May Fung, Cheng Qian, Jeonghwan Kim, Dilek Hakkani-Tur,, Heng Ji

PDF

Open Access 1 Repo

TL;DR

This paper presents a method for training large language models to dynamically infer and align with individual user preferences through multi-turn interactions, enhancing personalized user experiences.

Contribution

We introduce a novel approach that enables LLMs to implicitly infer user preferences during conversations and adapt responses accordingly, using a new dataset and benchmark.

Findings

01

Effective dynamic personalized alignment demonstrated

02

Created a diverse user persona dataset with 3,310 personas

03

Established the ALOE benchmark for measuring personalized alignment

Abstract

As large language models (LLMs) demonstrate increasingly advanced capabilities, aligning their behaviors with human values and preferences becomes crucial for their wide adoption. While previous research focuses on general alignment to principles such as helpfulness, harmlessness, and honesty, the need to account for individual and diverse preferences has been largely overlooked, potentially undermining customized human experiences. To address this gap, we train LLMs that can ''interact to align'', essentially cultivating the meta-skill of LLMs to implicitly infer the unspoken personalized preferences of the current user through multi-turn conversations, and then dynamically align their following behaviors and responses to these inferred preferences. Our approach involves establishing a diverse pool of 3,310 distinct user personas by initially creating seed examples, which are then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shujinwu-0814/aloe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Semantic Web and Ontologies · Business Process Modeling and Analysis

MethodsALIGN