Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes
Zhuocheng Gong, Jian Guan, Wei Wu, Huishuai Zhang, Dongyan Zhao

TL;DR
This paper introduces Latent Preference Coding (LPC), a novel framework that models complex human preferences in large language models using discrete latent codes, improving alignment robustness without pre-defined reward functions.
Contribution
LPC provides a unified, data-driven approach to model multifaceted human preferences, enhancing alignment methods without relying on explicit reward functions or hand-crafted weights.
Findings
LPC improves alignment performance across multiple benchmarks.
Latent codes capture differences in human preference distributions.
LPC enhances robustness of alignment against data noise.
Abstract
Large language models (LLMs) have achieved remarkable success, yet aligning their generations with human preferences remains a critical challenge. Existing approaches to preference modeling often rely on an explicit or implicit reward function, overlooking the intricate and multifaceted nature of human preferences that may encompass conflicting factors across diverse tasks and populations. To address this limitation, we introduce Latent Preference Coding (LPC), a novel framework that models the implicit factors as well as their combinations behind holistic preferences using discrete latent codes. LPC seamlessly integrates with various offline alignment algorithms, automatically inferring the underlying factors and their importance from data without relying on pre-defined reward functions and hand-crafted combination weights. Extensive experiments on multiple benchmarks demonstrate that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining
MethodsBalanced Selection
