Aligning LLMs with Individual Preferences via Interaction
Shujin Wu, May Fung, Cheng Qian, Jeonghwan Kim, Dilek Hakkani-Tur,, Heng Ji

TL;DR
This paper presents a method for training large language models to dynamically infer and align with individual user preferences through multi-turn interactions, enhancing personalized user experiences.
Contribution
We introduce a novel approach that enables LLMs to implicitly infer user preferences during conversations and adapt responses accordingly, using a new dataset and benchmark.
Findings
Effective dynamic personalized alignment demonstrated
Created a diverse user persona dataset with 3,310 personas
Established the ALOE benchmark for measuring personalized alignment
Abstract
As large language models (LLMs) demonstrate increasingly advanced capabilities, aligning their behaviors with human values and preferences becomes crucial for their wide adoption. While previous research focuses on general alignment to principles such as helpfulness, harmlessness, and honesty, the need to account for individual and diverse preferences has been largely overlooked, potentially undermining customized human experiences. To address this gap, we train LLMs that can ''interact to align'', essentially cultivating the meta-skill of LLMs to implicitly infer the unspoken personalized preferences of the current user through multi-turn conversations, and then dynamically align their following behaviors and responses to these inferred preferences. Our approach involves establishing a diverse pool of 3,310 distinct user personas by initially creating seed examples, which are then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Semantic Web and Ontologies · Business Process Modeling and Analysis
MethodsALIGN
