User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

Yuren Hao; Shuhaib Mehri; ChengXiang Zhai; Dilek Hakkani-T\"ur

arXiv:2603.20939·cs.CL·March 24, 2026

User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

Yuren Hao, Shuhaib Mehri, ChengXiang Zhai, Dilek Hakkani-T\"ur

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper introduces VARS, a framework that personalizes conversational LLM agents by representing user preferences with vectors updated through weak feedback, improving interaction efficiency and interpretability without fine-tuning.

Contribution

VARS is a novel, pipeline-agnostic approach that models user preferences with long-term and short-term vectors, enabling effective personalization through online updates from weak scalar rewards.

Findings

01

VARS improves interaction efficiency over raw task accuracy.

02

The long-term vectors align with cross-user preferences.

03

Short-term vectors adapt to session-specific needs.

Abstract

Large language models are increasingly used as personal assistants, yet most lack a persistent user model, forcing users to repeatedly restate preferences across sessions. We propose Vector-Adapted Retrieval Scoring (VARS), a pipeline-agnostic, frozen-backbone framework that represents each user with long-term and short-term vectors in a shared preference space and uses these vectors to bias retrieval scoring over structured preference memory. The vectors are updated online from weak scalar rewards from users' feedback, enabling personalization without per-user fine-tuning. We evaluate on \textsc{MultiSessionCollab}, an online multi-session collaboration benchmark with rich user preference profiles, across math and code tasks. Under frozen backbones, the main benefit of user-aware retrieval is improved interaction efficiency rather than large gains in raw task accuracy: our full VARS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
blackhao0426/pref-extractor-qwen3-0.6b-full-sft
model· 126 dl
126 dl

Datasets

blackhao0426/user-preference-564k
dataset· 21 dl
21 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · AI in Service Interactions · Multimodal Machine Learning Applications