How Does Personalized Memory Shape LLM Behavior? Benchmarking Rational Preference Utilization in Personalized Assistants
Xueyang Feng, Weinan Gan, Xu Chen, Quanyu Dai, Yong Liu

TL;DR
This paper introduces RPEval, a benchmark for evaluating personalized memory in LLMs, revealing irrational personalization issues and proposing RP-Reasoner to improve memory utilization through pragmatic reasoning, enhancing user experience.
Contribution
The paper presents RPEval for assessing personalized memory in LLMs and introduces RP-Reasoner, a pragmatic reasoning approach that improves memory selection and reduces irrational personalization.
Findings
RPEval uncovers widespread irrational personalization in LLMs.
RP-Reasoner significantly outperforms baselines on RPEval.
The method resolves 80% of problematic cases in commercial assistants.
Abstract
Large language model (LLM)-powered assistants have recently integrated memory mechanisms that record user preferences, leading to more personalized and user-aligned responses. However, irrelevant personalized memories are often introduced into the context, interfering with the LLM's intent understanding. To comprehensively investigate the dual effects of personalization, we develop RPEval, a benchmark comprising a personalized intent reasoning dataset and a multi-granularity evaluation protocol. RPEval reveals the widespread phenomenon of irrational personalization in existing LLMs and, through error pattern analysis, illustrates its negative impact on user experience. Finally, we introduce RP-Reasoner, which treats memory utilization as a pragmatic reasoning process, enabling the selective integration of personalized information. Experimental results demonstrate that our method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Artificial Intelligence in Healthcare and Education · Spreadsheets and End-User Computing
