Latent Preference Coding: Aligning Large Language Models via Discrete   Latent Codes

Zhuocheng Gong; Jian Guan; Wei Wu; Huishuai Zhang; Dongyan Zhao

arXiv:2505.04993·cs.CL·May 9, 2025

Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes

Zhuocheng Gong, Jian Guan, Wei Wu, Huishuai Zhang, Dongyan Zhao

PDF

Open Access

TL;DR

This paper introduces Latent Preference Coding (LPC), a novel framework that models complex human preferences in large language models using discrete latent codes, improving alignment robustness without pre-defined reward functions.

Contribution

LPC provides a unified, data-driven approach to model multifaceted human preferences, enhancing alignment methods without relying on explicit reward functions or hand-crafted weights.

Findings

01

LPC improves alignment performance across multiple benchmarks.

02

Latent codes capture differences in human preference distributions.

03

LPC enhances robustness of alignment against data noise.

Abstract

Large language models (LLMs) have achieved remarkable success, yet aligning their generations with human preferences remains a critical challenge. Existing approaches to preference modeling often rely on an explicit or implicit reward function, overlooking the intricate and multifaceted nature of human preferences that may encompass conflicting factors across diverse tasks and populations. To address this limitation, we introduce Latent Preference Coding (LPC), a novel framework that models the implicit factors as well as their combinations behind holistic preferences using discrete latent codes. LPC seamlessly integrates with various offline alignment algorithms, automatically inferring the underlying factors and their importance from data without relying on pre-defined reward functions and hand-crafted combination weights. Extensive experiments on multiple benchmarks demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining

MethodsBalanced Selection