Learning Transferable Latent User Preferences for Human-Aligned Decision Making

Alina Hyk; Sandhya Saisubramanian

arXiv:2605.12682·cs.AI·May 14, 2026

Learning Transferable Latent User Preferences for Human-Aligned Decision Making

Alina Hyk, Sandhya Saisubramanian

PDF

TL;DR

The paper introduces CLIPR, a framework that learns transferable natural language rules to infer latent user preferences from minimal interactions, enhancing human-aligned decision making in LLMs.

Contribution

It proposes a novel method for inferring and applying latent user preferences using minimal conversational input, improving generalization and reducing inference costs.

Findings

01

CLIPR outperforms existing methods in alignment accuracy.

02

CLIPR reduces the number of interactions needed for preference inference.

03

CLIPR demonstrates effectiveness across multiple datasets and environments.

Abstract

Large language models (LLMs) are increasingly used as reasoning modules in many applications. While they are efficient in certain tasks, LLMs often struggle to produce human-aligned solutions. Human-aligned decision making requires accounting for both explicitly stated goals and latent user preferences that shape how ambiguous situations should be resolved. Existing approaches to incorporating such preferences either rely on extensive and repeated user interactions or fail to generalize latent preferences across tasks and contexts, limiting their practical applicability. We consider a setting in which an LLM is used for high-level reasoning and is responsible for inferring latent user preferences from limited interactions, which guides downstream decision making. We introduce CLIPR (Conversational Learning for Inferring Preferences and Reasoning), a framework that learns actionable,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.