Learning Transferable Latent User Preferences for Human-Aligned Decision Making
Alina Hyk, Sandhya Saisubramanian

TL;DR
The paper introduces CLIPR, a framework that learns transferable natural language rules to infer latent user preferences from minimal interactions, enhancing human-aligned decision making in LLMs.
Contribution
It proposes a novel method for inferring and applying latent user preferences using minimal conversational input, improving generalization and reducing inference costs.
Findings
CLIPR outperforms existing methods in alignment accuracy.
CLIPR reduces the number of interactions needed for preference inference.
CLIPR demonstrates effectiveness across multiple datasets and environments.
Abstract
Large language models (LLMs) are increasingly used as reasoning modules in many applications. While they are efficient in certain tasks, LLMs often struggle to produce human-aligned solutions. Human-aligned decision making requires accounting for both explicitly stated goals and latent user preferences that shape how ambiguous situations should be resolved. Existing approaches to incorporating such preferences either rely on extensive and repeated user interactions or fail to generalize latent preferences across tasks and contexts, limiting their practical applicability. We consider a setting in which an LLM is used for high-level reasoning and is responsible for inferring latent user preferences from limited interactions, which guides downstream decision making. We introduce CLIPR (Conversational Learning for Inferring Preferences and Reasoning), a framework that learns actionable,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
