Reasoning Meets Personalization: Unleashing the Potential of Large Reasoning Model for Personalized Generation
Sichun Luo, Guanzhi Deng, Jian Xu, Xiaojie Zhang, Hanxu Hou, Linqi Song

TL;DR
This paper evaluates large reasoning models for personalization, identifies key limitations, and proposes a novel framework with hierarchical reasoning and intervention methods to improve personalized generation performance.
Contribution
It introduces Reinforced Reasoning for Personalization ( extit{ReaPer}), a framework that enhances LRM personalization through structured reasoning templates and alignment techniques.
Findings
LRMs do not always outperform general LLMs in retrieval scenarios
The proposed extit{ReaPer} framework significantly improves personalization quality
Hierarchical reasoning and intervention methods enhance model alignment and consistency
Abstract
Personalization is a critical task in modern intelligent systems, with applications spanning diverse domains, including interactions with large language models (LLMs). Recent advances in reasoning capabilities have significantly enhanced LLMs, enabling unprecedented performance in tasks such as mathematics and coding. However, their potential for personalization tasks remains underexplored. In this paper, we present the first systematic evaluation of large reasoning models (LRMs) for personalization tasks. Surprisingly, despite generating more tokens, LRMs do not consistently outperform general-purpose LLMs, especially in retrieval-intensive scenarios where their advantages diminish. Our analysis identifies three key limitations: divergent thinking, misalignment of response formats, and ineffective use of retrieved information. To address these challenges, we propose Reinforced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
